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(57) Abstract: A system for the detection of gene activation events is provided which comprises a nucleic acid construct encoding a 
Q protein of the lipocalin protein family and a peptide tag in which the expression of the construct in a cell or in the cells of a transgenic 
^ animal demonstrates the activation of a gene or genes of interest, in which the protein expressed is secreted from the cell and in 
»^ which detection of the peptide tag indicates expression of the construct. 
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MULTI-REPORTER GENE MODEL FOR TOXICOLnrrT CAT. SrRFRNnn^r: 

The present invention relates to a non-invasive reporter gene system for the detection 
of gene activation events related to altered metaboMc status in vivo or in vitro for use 
5 in toxicological screening: 

Genes encode proteins. It is estimated that there at least 3 x 10* genes in the vertebrate 
genome but for a given cell only a subset of the total number of genes is active, with 
the subset differing between cells of different types and between different stages of 

10 development and differentiation (Cho & CampbeU Trends Genet. 16 409-415 (2000); 
Velculescu et al Trends Genet. 16 423-425 (2000)). The DNA regulatoiy elements 
associated with each gene governs the decision as to which genes are active and which 
are not. Although comprising a number of defined elements these DNA sequences are 
collectively termed promoters (Tjian & Maniatis Cell 77 5-8 (1994); Bonifer, Trends 

15 Genet. 16 310-315 (2000); Martin, Trends Genet 17 444-448 (2001)). 

Gene activation occurs primarily at the transcriptional level. Transcriptional activity of 
a gene may be measured by a variety of approaches including RNA polymerase 
activity, mRNA abundance or protein production (Takano et al., 2002). These 
approaches are limited in that they require development of an assay suitable to each 
.'••individual mRNA or protein product To facilitate comparison of different promoters, 
rather than assaying individual gene products, reporter genes are often used (Sun et cd 
Gene Ther. 8 1572-1579 (2001); Franco et al Eur. J. Morphol. 39 169-191 (2001); 
Hadjantonalds & Nagy. Histochetn. Cell. Biol. 115 49-58 (2001); Gorman Mol. Cell 
Biol 2 1044-1051 (1982); Barash and Reichenstein, 2002; Zhang et al., 2001.). 

The product (mRNA or protein) of a reporter gene allows an assessment of the 
transcriptional activity of a particular gene and can be used to distinguish cells, tissues 
or organisms in which the event has occurred ftom those in which it has not On the 
whole reporter genes are foreign to the host cell or organism. aUowmg their activity to 
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be easily distinguished from the activity of endogenous genes. Alternatively the 
reporter may be marked or tagged so as to make it distinct from host genes. 

Reporter genes are linked to the test promoter, enabling activity of the promoter gene 
5 to be determined by detecting the presence of the reporter gene product Therefore, the 
main prerequisite for a reporter gene product is that it is easy to detect and quantify. In 
some cases, but not all, the reporter gene has enzymatic activity that catalyses the 
conversion of a substrate into a measurable product. 

10 A classical example is the bacterial chloramphenicol acetyl transferase (CAT) gene. 
CAT activity can be measured in cell extracts as conversion of added non-acetylated 
chloramphenicol to the acetylated form of chloramphenicol by chromatography 
(Gorman Mol Cell Biol 2 1044-1051 (1982)). Similar strategies enable the use of the 
firefly luciferase gene as a reporter. In this instance it is the light produced by 

15 bioluminescence of the lucifeiin substrate that is measured. 

Some reporters also benefit from tiie visual detection assays that allow in situ analysis 
of reporter activity. A frequently used example would be ^-galactosidase (Lac Z), 
where the addition of an artificial substrate, X-gal, enables reporter activity to be 

20 detected by the appearance of blue colouration in the sample. As it is accumulative it 
effectively provides an historical record of its induction. This is particularly useful for 
measuring transient responses where a promoter is activated for only a short time 
before being rapidly inactivated. This reporter has been successfully used both in 
cultured cells and in vivo (Campbell et al J. Cell Biol 109 2619-2625 (1996)), though 

25 its suitability for in vivo use has been questioned in some reports (Sanchez-Ramnos et 
al Cell Transplant. 9 657-667 (2000); Montoliu et al Transgenic Res. 9 237-239 
(2000); Cohen-Tannoudji et al Transgenic Res. 9 233-235 (2000)). It has been 
demonstrated fliat Lac Z in combination with fluorescent substrates can enable the 
sorting of cells that express the reporter by use of a fluorescence-activated cell sorter 

30 (FACS) (Fiering et al Cytometry 12 291-301 (1991)). 
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In Other systems, the reporter product itself is direcUy detected, removing the need for 
a substrate. Green fluorescent protein has become on of the most commonly used 
examples of this category of reporter (Hcawa et al Curr. Top. Dev. Biol 44 1-20 
(1997)). This autofluoresdng protein was derived from the bioluminescent jellyfish 
5 Aequoria victoria. Several colour spectral variants of this reporter have been 
developed (Hadjantonakis & Nagy, Histochem. Cett. Biol 115 49-58 (2001)). 

Recently reporter systems based on energy emission systems have been developed. 
These include single photon emission computed tomography (SPECT) and positron 
10 emission tomography (PET) though these require the introduction of a radiolabelled 
isotope probe in to the host cell or animal that is then modified by the target reporter 
gene. For example the PET system measures reporter sequestering of the positron 
emitting probe (Sun et al Gene Ther. 8 1572-1579 (2001)). These are summarised as 
follows: 

15 



Established reporter 


Enzymatic 


Light based 


alkaline phosphatase 


Green fluorescent protein 


Beta galactosidase 


dsRed 


Thymidine kinase 


Luciferase 


Neomycin resistance 




Chloramphenicol acetyl transferase 




Growth honnone 





Many tried and tested reporter systems have been developed but nevertheless share 
certain limitations. Those based on prokaryote genes often suffer poor expression in 
transgenic manmials (Montoliu et al Transgenic Res. 9 237-238 (2000); Cohen- 
20 Tannoudji et al Transgenic Res. 9 233-235 (2000)). Furthermore the presence of 
prokaryote DNA sequences has been implicated in the suppression of expression ftom 
adjacent eukaryote transgenes as have the presence of intronless, cDNA based 
eukaryote gene sequences (Qaik et al., 1997). 
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Most of the current reporters, whilst useful for monitoring expression under certain 
circumstances, have certain limitations. Many accumulate in cells and are not useful 
for monitoring changes in promoter activation over time. Perhaps more importantly 
5 detection of expression necessitates the fixing of cultured cells or the sacrifice of 
transgenic animals, thus limiting reporters to invasive detection strategies. There are a 
few exceptions and these include the use of growth hormone (Bchini et al 
Endocrinology 128 539-546 (1991)), However its high biological activity effectively 
limit its widespread applicability. Another enzyme that has been used in vi\>o is a 

10 secreted version of alkaline phosphatase (SEAP) (Nilsson et al Cancer Chemother, 
Pharmacol 49 93-100 (2002); Durocher NucL Acids. Res. 30 E9 (2002)) though 
again, the potential biological effects resulting from its heterologous expression 
remain untested. GFP has been detected in whole animals and though possessing 
relatively low biological activity its use has so far been limited to neonatal and nude 

15 mice in which both internal tissue and dermal fluorescence are more readily observed. 
In addition there has been a report that GEP is cytotoxic (Liu et al Biochem. Biophys. 
Res. Comm. 260 712-717 (1999)). Although reporter systems based on tomography 
allow monitoring of reporter expression in internal tissues they require addition of 
exogenously added substrates that could potentially confound results by influencing 

20 expression of the reporter. Additionally they can lack the sensitivity required for 
quantitative analysis of reporter expression. 

There is therefore a need for a reporter system that overcomes some or all of these 
limitations. Primarily it should be non-invasive inasmuch as its detection does not 

25 involve addition of an external substrate or sacrifice of transgenic animals. This would 
also ideally stipulate that the reporter be secreted (in vitro and i/i viv^?) or excreted (m 
vivo). Secondly it should be biologically neutral with regard to the test expression 
system so that no phenotypic effects either confound readout fi"om the system or affect 
the health of the transgenic animal. Thirdly a family of reporters sharing similar and 

30 therefore predictable characteristics allowing comparison between reporters is 
required. This may be achieved if members share a common structure or backbone. 
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A system satisfying these reqiniements has now been found. The members of the 
lipocalin protein family fulfil the necessary characteristics for a non-invasive reporter. 

According to a first aspect of the invention, there is provided a nucleic acid construct 
comprising (i) a nucleic acid sequrace encoding a member of the lipocalin protein 
family, and (ii) a nucleic acid sequence raicoding a peptide sequence of from 5 to 250 
amino acid residues 



The lipocalins are a diverse family of small molecule transporter proteins that share a 
common conserved gene structure (Flower et al BiocMm. Biophys Acta 1482 9-24 
(2000)), Members of this family are small in size with the majority falling into the 18- 
25KD range. Some are naturaUy secreted, e.g. ovine betalactoglobulin (BLG) 
(accession No. X12817), or excreted e.g. murine major urinary protein QJBJP) (e.g. 
accession No. NM 031188) and rat 06-2-urinary globulin (a-2u) (accession number 
M27434). lipocaUn reporters will preferably be either MUP, BLG or cfr-2u but could 
be chosen from the foUowing list of other lipocalin family members shown in Table 1: 



Table 1 



Protein 


Subunit 

molecular 

mass 


Pl 


No. 

residues 


Oligomeric 
State 


Glycosyln. 


No. 
S=S 


Abbr./ref 


Kernel IlDOcalins 
















Retinol-binding 
protein 


21.0 


5.5 


183 


Monomer 




3 


RBP (1), 
(2) 


Purpurin 


20.0 




175 








PURP (3) 


Retinoic acid- 
binding orotein 


18.5 


5.2 


166 


Monomer 




1 


RABP(4) 


oe2u-<3lobulin 


18.7 


5.7- 
6.7 


162 


Dimer 




1 


A2U (5)- 
(7) 


Major urinary 
protein 


17.8 


5.5- 
5.7 


161 


DImer 




1 


MUP (8)- 
(10) 


Birin-binding 
protein 


19.6 




173 


Tetramer 




2 


BBP (11) 


a- 

Crustacvanfn 


350.0 


4.3- 
4.7 


174/181 


Octamer of 
heterodimers 




2/2 


(12) (13) 


Pregnancy 
protein 14 


56JD 




162 


Homodimer 


+ 




PP14 (15) 


Lactoglobulin 


18.0 


5.2 


162 




Dimer/ 

monomer " 




2 


Big (16)- 
(18) 



10 



15 
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Protein 


Subunit 
molecular 

mass 


Pl 


No. 

residues 


Oligomeric 
State 


Giycosyln. 


No. 
S=S 


Abbr./ref 


otr 

Microglobulln 


31.0 


4.3- 
4.8 


188 


Monomer 
+complexes 


+ 


1 


A1M(19) 


C8y 


22.0 




182 


Part of complex 




1 


C8y(20) 


Apofipoprotein 

D 


29.0-32.0 


4.7- 
5.2 


169 


Dimer 

+complexes 


+ 


2 


ApoD 
(21)-{23) 


Lazarillo 


45.0 




183 


Monomer 


+ 


+ 


LAZ(24) 


Prostaglandin 
D synthase 


27.0 


4.6 


168 


Monomer 


+ 


1 


PGDS 
(25) 


Quiescence- 
specific protein 


21.0 


6.3 


158 






1 


QSP (26)- 
(28) 


Neutrophil 
lipocalin 


25.0 




179 


Monomer/ 

DImer 

+complexes 






NGAL 
(29)-(32) 


Choroid plexus 
protein 


20.0 




183 


Monomer 


_ 




(33) 


Outlier 
lipocallns 
















Odorant- 
binding protein 


37.0-40.0 


4.7 


159 


DImer 




0 


OBP (34)- 
(36) 


von Ebner's- 
qfand protein 


18.0 


4.8- 
5.2 


170 


Dimer 




1 


VEGP 
(37)-(40) 


tti-Acid 
glycoprotein 


40.0 


3.2 


183 


Monomer 


+ 


2 


AGP 
(41)(42) 


Probasin 


20.0 


11.5 


160 








PBAS (43) 


Aphrodisin 


17.0 




151 




+ 


2 


(44) 



"Giycosyln". = glycosyladon 
' "No. S=S" = no. of disulphides 
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10 (18) Escribano et al Biochem. Biophys. Res. Comm. 155 1424-1429 (1988) 

(19) Haefliger et al Mol. Immunol. 28 123-131 (1991) 

(20) Balbin et al Biochem. J. 271 803-807 (1990) 

(21) Peatsch, M. C. & Boguski, M S., New Biol. 2 197-206 (1990) 

(22) Morais Cabral et al FEBS Lett 366 53-56 (1995) 
15 (23) Ganfomia et al Development 121 123-134 (1995) 

(24) Urade et al J. Biol. Chem. 264 1041-1045 (1988) 

(25) Cancedda et al J. Cell Biol. 107 2455-2463 (1990) 

(26) Cancedda et al Biochem. Biophys. Res. Comm. 168 933-938 (1990) 

(27) Nakano, T. & Graf, T. Oncogene 7 527-534 (1992) 
20 (28) Hraba-Renevey et al Oncogene 4 601-608 (1989) 

(29) Meheus et al J. Immunol 151 1535-1547 (1993) 

(30) Liu, Q., & Nilsen-Hamilton, M. J. Biol Chem. 270 22565-22570 (1995) 

(31) Kasik, J. W. & Rice, E. J. Am. J. Obstet. Gynecol 173 613-617 (1995) 

(32) Achen etalj. Bidl Chem. 267 23170-23174 (1992) 
25 (33) Snyder et al J. Biol Chem. 263 13971-13974 (1988) 

(34) Lecetal Science 235 1053-1056 (1987) 

(35) Cavaggioni et cd FEBS Lett 212 225-228 (1987) 

(36) Kock et al Physiol Behav. 56 1 173-1 177 (1994) 

(37) Schmale et al Ciha Found. Symp. 179 167-185 (1993) 
30 (38) Redl et al J. Biol Chem. 267 20282-20827 (1992) 

(39) Glasgow et al Curr. Eye Res. 14 363-372 (1995) 



DOCIO: <WO___J00401 1676A«.L> 



wo 2004/011676 



PCT/GB2003/003192 



(40) Kremer et al Pharmacol Rev, 40 1-40 (1988) 

(41) Amaud et al Methods Enzymol 163 418-431 (1988) 

(42) Matuo et al Biochem. Biophys. Res. ConvTL 118 467-473 (1984) 

(43) Henzel et al 7. Biol Ghent 263 16682-16687 (1988) 

5 (44) Magert et al Proc. Nafl Acad. Sci. USA 92 2091-2095 (1995) 

The nucleic acid sequences of the present invention also include sequences that are 
homologous or complementary to those referred to above. The p^cent identity of two 
nucleic acid sequences is determined by aligning the sequences for optimal 

10 comparison purposes (e.g., gaps can be introduced in the first sequence for best 
alignment with the sequence) and comparing the amino acid residues or nucleotides at 
corresponding positions. The "best alignment" is an alignment of two sequences 
which results in the highest percent identity. The percent identity is determined by the 
number of identical amino acid residues or nucleotides in the sequences being 

15 compared (z.e., % identity = # of identical positions/total # of positions x 100). 

The determination of percent identity between two sequences can be accomplished 
using a mathematical algorithm known to those of skill in the art. An example of a 
mathematical algorithm for comparing two sequences is the algorithm of Karlin and 

20 Altschul Proc. Natl Acad. Set USA (1990) 87:2264-2268, modified as in Karlin and 
Altschul (1993) Proc. Natl Acad Sci. USA 90:5873-5877. The NBLAST and 
XBLAST programs of Altschul et al, 7. Mol Biol (1990) 215:403-410 have 
incorporated such an algorithm. BLAST nucleotide searches can be performed with 
the NBLAST program, score = 100, wordlength = 12 to obtain nucleotide sequences 

25 homologous tp a nucleic acid molecules of the invention. To obtain gapped alignments 
for comparison purposes. Gapped BLAST can be utilised as described in Altschul et 
al. Nucleic Acids Res. (1997) 25:3389-3402. Alternatively, PSI-Blast can be used to 
perform an iterated search which detects distant relationships between molecules (Id). 
When utilising BLAST, Gapped BLAST, and PSI-Blast programs, the default 

30 parameters of the respective programs (e.g., NBLAST) can be used. See 
w ww.ncbi .nlm .nih. gov . 
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Another example of a mathematical algorithm utilised for the comparison of 
sequences is the algorithm of Myers and Miller, CABIOS (1989). The ALIGN 
program (version 2.0) which is part of the GCG sequence alignment software package 
has incorporated such an algorithm. Other algorithms for sequence analysis known in 
the art include ADVANCE and ADAM as described in ToielUs and Robotti Ccnnput. 
Appl. BioscL (1994) 10:3-5; andFASTA described in Pearson and lipman Proc. Natl. 
Acad. Sci. USA (1988) 85:2444-8. Within FASTA. ktup is a control option that sets the 
sensitivity and speed of the search. 

A nucleic acid sequence which is complementary to a nucleic add sequence of the 
present invention is a sequence which hybridises to such a sequence under stringent 
conditions, or a nucleic acid sequence which is homologous to or would hybridise 
under stringent conditions to such a sequence but for the degeneracy of the genetic 
code, or an oligonucleotide sequence specific for any such sequence. The nucleic acid 
sequences include oligonucleotides composed of nucleotides and also fliose composed 
of peptide nucleic adds. Where die nucldc sequence is based on a fragment of the 
sequences of the invention, the fragm«it may be at least any ten consecutive 
nucleotides from tiie gene, or for example an oligonucleotide composed of from 20, 
30, 40, or 50 nucleotides. 

Stringent conditions of hybridisation may be characterised by low salt concentrations 
or high temperature conditions. For example, highly stringent conditions can be 
defined as being hybridisation to DNA bound to a solid support in 0.5M NaHP04, 7% 
sodium dodecyl sulfate (SDS), ImM EDTA at 65°C, and washing in O.lxSSC/ 
0.1%SDS at 68°C (Ausubd et al eds. "Current Protocols in Molecular Biology" 1, 
page 2.10.3, published by Green Publishing Assodates, Inc. and John Wiley & Sons, 
Inc., New York, (1989)). In some circumstances less stringent conditions may be 
required. As used in tiie present application, moderatdy stringent conditions can be 
defined as comprising washing in 0.2xSSao.l%SDS at 42''C (Ausubel et al (1989) 
supra). Hybridisation can also be made more s tringe nt by the addition of increasing 
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amounts of fonnamide to destabilise the hybrid nucleic acid duplex. Thus particular 
hybridisation conditions can readily be manipulated, and will generaUy be selected 
according to the desired results. In general, convenient hybridisation temperatures in 
the presence of 50% formamide are 42^C for a probe which is 95 to 100% homologous 
5 to the target DNA, 37°C for 90 to 95% homology, and 32°C for 70 to 90% homology. 

Examples of preferred nucleic acid sequences for use in according to the various 
aspects of the present invention are the sequences of the invention are disclosed 
herein; Complementary or homologous sequences may be 75%, 80%, 85%, 90%, 
10 95%, 99% sinailar to such sequences. 

With the addition of peptide tags to a chosen lipocalin reporter there is provided a 
useful sub-family of reporter proteins. Essentially it allows generation of a large 
number of reporters from a single lipocalin where that lipocalin acts as the carrier for a 

15 range of peptides that can be clearly differentiated from one another by a range or 
biological or physical assay techniques. For example it has been demonstrated that a 
casein kinase recognition sequence engineered in exon 3 of the ovine 
betalactoglobulin (BLG) gene resulted in expression of a novel form of BLG 
containing an active kinase substrate in one of the surface loops of the protein in 

20 transgenic mice (McClenaghan et al Protein Eng. 12 259-264 (1999)). 

The position of the peptide tag may be at the amino temiinal or carboxy temodnal or 
inserted internally with respect to the amino acid sequence of the reporter. All three 
examples are represented in Figure 1. 

25 

The peptide tag can be a sequence consisting of between 5 to 250 amino acids. 
Suitably, in the ranges of from, 5 to 50, 10 to 60, 20 to 70, 30 to 80, 40 to 90, and so 
on. In some embodiments of the invention peptides may be required to consist of a 
greater number of amino acids than 250 residues. 

30 
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In a preferred embodiment of the invention the peptide tag may be ah epitope, that is a 
defined amino acid sequence from a protein with a fiiUy characterised cognate 
antibody. The skilled person can select such epitopes based on sequences identified as 
possessmg antigenic properties. In certain embodiments of the invention the epitope 
5 tag may be the amino acid sequence below fix)m the c-myc oncogene (Evans et al Mol 
Cell. Biol S 3610-3616 (1985)): 

-Glu-Ghi-Lys-Leu-IIe-Ser-Glu-Glu-Asp-Leu- 
10 (EQKUSEEDL) 



or it may be the amino acid sequence from the simian virus V5 protein (Southern et al 
J. Gen. Virol. 72 1551-1557 (1991)), shown below: 

-Gly-Lys-Pro-ne-Pro-Asn-Pro-Leu-Leu-Gly-Leu-Asp-Ser-Thr- 
(GKPIPNPLLGLDST) 

In certain embodiments of the invention, the epitope may be selected from but not 
limited to the c-myc and V5 proteins. 

Other alternative epitopes may include, but are not limited to: 



Haemagjutinin (YPYDVPDYA) 
25 aonelOO (NVRFSTTVRRRA) 



rablla 
DOB 



30 ARF 



(KQMSDRRENDMSPS) 
(SGNEVSRAVLIPQSC) 
soil (SSLSYTNPAVAATSANL) 
ert>B4 (RSTLQHPDYLQEYST) 
(VSTIXRWERPPGHRQA) 



.R'^ J^aFQQLVQCLTEEEIAALGAYV) 
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WDLPEPl (QEQCQEVWRKRVISAPLKSP) 
HAFIO (RLSDKTGPVAQEKS) 

Preferably the epitope tag is recognised by its cognate antibody irrespective of whether 
S it is located at the amino terminal, carboxy terminal or in an internal domain of the 
reporter protein. 

Li another embodiment of the invention the peptide tag may possess enzymatic 
activity that converts a substrate to a forai that is readily detectable by an assay. For 

10 example a kinase activity specifying phosphorylation of another protein or peptide 
substrate that could be added to the secreted or excreted analyte along with a 
phosphate group donor. Detection could be achieved using an inmiunological assay 
based on detection by an antibody specifically recognising the phosphorylated version 
of the tagged reporter protein. Alternatively the use of phosphate radiolabelled with an 

15 isotope of phosphorous such as ^^P or ^^P. Other enzymic modifications include for 
example acetylation, sulphation and glycosylation. Another possibility is peptide tag 
that is an enzyme, that is the construct comprises a nucleic acid sequence encoding an 
enzyme, or a nucleic acid sequence encoding a catalytic sequence thereof, such as 
Glutathoine-S-transferase (GST) where enzyme activity can be detected by means of 

20 an activity assay or by antibody reactivity. 

Suitably, the nucleic acid sequence encoding the member of the lipocalin protein 
family is contiguous with the nucleic acid sequence encoding the peptide sequence. 
However, a linker nucleic acid sequence may be inserted between these two sequences 
25 that encodes a short number of amino acids. 

The nucleic acid construct may additionally comprise a promoter element upstream of 
the nucleic acid encoding the rnember of the lipocalin protein family. The promoter 
element may be an inducible promoter, preferably a stress inducible promoter. 

30 



2004011 676A2 I > 



10 



wo 2004/011676 PCT/GB2003/003192 

13 

It is also within the scope of the present invention for the nucleic acid construct to 
include more than one detectable peptide label. Such as for example, a peptide antigen 
and an enzyme (or an active catalytic site thereof). One possible combination is the 
peptide epitope c-myc and the enzyme GST. 

Other embodiments of this aspect could include, for example site of interaction with 
protein other than antibody e.g. lectin binding site, or modification of tag by e.g. 
addition of amino acid multimer such as polylysine; or incorporation of a 
fluorochrome. 

The peptide sequence may be as described above but it also extends to peptides and 
polypeptides that are substantially homologous thereto. The term "polypeptide'* 
includes both peptide and protein, unless the context specifies otherwise. 

15 Such peptides include analogues, homologues, orthologues, isoforms, derivatives, 
fusion proteins and proteins with a similar structure or are a related polypeptide as 
herein defined. 

The term "analogue" as used herein refers to a peptide that possesses a similar or 
20 identical function as a peptide coded for by a nucleic acid sequence of the invention 
but need not necessarily comprise an amino acid sequence that is similar or identical to 
an amino acid sequence of the invention, or possess a structure that is similar or 
identical to that of a peptide of the invention. As used herein, an amino acid sequence 
of a peptide is "similar" to that of a peptide of the invisntion if it satisfies at least one of 
25 the following criteria: (a) the peptide has an amino acid sequence that is at least 30% 
(more preferably, at least 35%, at least 40%, at least 45%, at least 50%, at least 55%, 
at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at 
least 90%, at least 95% or at least 99%) identical to the amino acid sequence of a 
peptide of the present invention; (b) the peptide is encoded by a nucleotide sequence 
30 that hybridizes under stringent conditions to a nucleotide sequence encoding at least 5 
amino acid residues (more preferably, at least 10 amino acid residues, at least 15 
amino acid residues, at least 20 "amino acid residues, at least 25 amino acid residues, at 
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least 40 amino acid residues, at least 50 amino acid residues, at least 60 amino 
residues, at least 70 amino acid residues, at least 80 amino acid residues, at least 90 
amino acid residues, at least 100 amino acid residues, at least 125 amino acid residues, 
or at least 150 amino acid residues) of a peptide sequence of the invention; or (c) the 
5 peptide is encoded by a nucleotide sequence that is at least 30% (more preferably, at 
least 35%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95% 
or at least 99%) identical to the nucleotide sequence encoding a peptide of the 
invention. 

10 

As used herein, a peptide with "similar structure" to that of a peptide of the invention 
refers to a peptide that has a similar secondary, tertiary or quaternary structure as that 
of a peptide of the invention. The stmcture of a peptide can determined by methods 
known to those skilled in the art, including but not limited to. X-ray crystallography, 
15 nuclear magnetic resonance, and crystallographic electron microscopy. 

The term "fusion protein" as used herein refers to a peptide that comprises (i) an 
anoino acid sequence of a peptide of the invention, a fragment thereof, a related 
peptide or a fragment thereof and (ii) an amino acid sequence of a heterologous 
20 peptide (f.e., not a peptide sequence of the present invention). 

The term "homologue" as used herein refers to a peptide that comprises an amino acid 
sequence similar to that of a protein of the invention but does not necessarily possess a 
similar or identical function. 

25 

The term "orthologue" as used herein refers to a peptide that (i) comprises an amino 
acid sequence similar to that of a protein of the invention and (ii) possesses a similar 
or identical function. 

30 The term "related peptide" as used 'herein refers to a homologue, an analogue, an 
isoform of , an orthologue, or any combination thereof of a peptide of the invention. 
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The term "derivative" as used herein refers to a peptide that comprises an amino acid 
sequence of a peptide of the invention which has been altered by the introduction of 
- amino acid residue substitutions, deletions or additions. The derivative peptide 
5 possess a similar or identical function as peptides of the invention. 

The tenn "fragment" as used herein refers to a peptide comprising an amino acid 
sequence of at least 5 amino acid residues (preferably, at least 10 amino acid residues, 
at least 15 amino acid residues, at least 20 amino acid residues, at least 25 amino acid 
10 residues, at least 40 amino acid residues, at least 50 amino acid residues, at least 60 
amino residues, at least 70 amino acid residues, at least 80 amino acid residues, at least 
90 amino acid residues, at least 100 amino acid residues) of the amino acid sequence 
of a peptide of the invention. 



15 



20 



The term "isoform" as used herein refers to variants of a peptide that are encoded by 
the same gene, but that differ in their isoelectric point (pi) or molecular weight (MW), 
or both. Such isoforms can differ in their amino add composition ie.g. as a result of 
alternative spKcing or limited proteolysis) and in addition, or in the alternative, may 
arise from differential post-translational modification (e.g., glycosylation, acylation, 
phosphorylation). As used h^dn, the term "isofonn" also refers to a protein that 
peptide exists in only a single form. i.e., it is not expressed as several variants. 



The percent identity of two amino add sequences or of two nudeic add sequences is 
deteraiined by aligning the sequences for optimal comparison purposes (e.g., gaps can 

25 be introduced in the first sequence for best alignment with the sequence) and 
comparing the amino add residues or nucleotides at corresponding positions. Hie 
"best alignment" is an alignment of two sequences which results in the highest percent 
identity. The percent identity is determined by the number of identical amino add 
residues or nudeotides in the sequences being compared (i.e., % identity = # of 

30 identical positions/total # of positions x 100). 
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The deteimination of percent identity between two sequences can be accomplished 
using a mathematical algorithm known to those of skill in the art. An example of a 
mathematical algorithm for coiBparing two sequeiices is the algorithm of Karlin and 
Altschul Proc. Natl Acad. Set USA (1990) 87:2264-2268, modified as in Karlin and 
5 Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-5877. The NBLAST and 
XBLAST programs of Altschul et al, /• Mol Biol (1990) 215:403-410 have 
incorporated such an algorithm. BLAST nucleotide searches can be performed with 
the NBLAST program, score = 100, wordlength = 12 to obtain nucleotide sequences 
homologous to a nucleic add molecules of the invention. BLAST protein searches 

10 can be performed with the XBLAST program, score = 50, wordlength = 3 to obtain 
amino acid sequences homologous to a protein molecules of the invention. To obtain 
gapped alignments for comparison puiposes. Gapped BLAST can be utilised as 
described in Altschul et al. Nucleic Acids Res. (1997) 25:3389-3402. Alternatively, 
PSI-Blast can be used to perform an iterated search which detects distant relationships 

15 between molecules (Id.). When utilising BLAST, Gapped BLAST, and PSI-Blast 
programs, the default parameters of the respective programs (e.g., XBLAST and 
NBLAST) can be used. See http://www.ncbi.nlm.nih.gov. 

Another example of a mathematical algorithm utilised for the comparison of 
20 sequences is the algorithm of Myers and Miller, CABIOS (1989). The ALIGN 
program (version 2.0) which is part of the GCG sequence alignment software package 
has incorporated such an algorithm. Other algorithms for sequence analysis known in 
the art include ADVANCE and ADAM as described in Torellis and Robotti Comput 
AppL Bioscl (1994) 10:3-5; and FASTA described in Pearson and lipman Proc, Natl 
25 Acad. Scu USA (1988) 85:2444-8. Within FASTA, ktup is a control option that sets the 
sensitivity and speed of the search. 

The skilled person is aware that various amino acids have similar properties. One or more 
such amino adds of a substance can often be substituted by one or more other such 
30 amino acids without eliminating a desired activity of that substance. Thxis the amino 
acids glycine, alanine, valine, leucine and isoleucine can often be substituted for one 
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another (amino acids having aliphatic side chains). Of these possible substitutions it is 
preferred that glycine and alanine are used to substitute for one another (since they have 
relatively short side chains) and that valine, leucine and isoleudne are used to substitute 
for one another (since they have larger aliphatic side chains which are hydrophobic). 
5 Other amino acids which can often be substituted for one another include: phenylalanine, 
tyrosine and tryptophan (amino acids having aromatic side chains); lysme, argmine and 
histidine (amino adds having basic side chains); aspartate and glutamate (amino adds 
havmg addic side chains); asparagine and glutamine (amino adds having amide side 
chains); and cystdne and methionine (amino adds having sulphur containing side 
10 chains). Substitutions of this nature are often referred to as "conservative" or "semi- 
conservative" amino add substitutions. 

Amino add deletions or insertions may also be made relative to the amino add sequence 
of a peptide sequence of the invention. Thus, for example, amino adds which do not 
have a substantial effect on tiie biological activity or immunogenidty of such peptides, or 
at least which do not eliminate such activity, may be deleted. Amino add insertions 
relative to tiie sequence of peptides of the invention can also be made . This may be done 
to alter the properties of a peptide of tiie present invention (e.g. to assist in identification, 
purification or expression. Such amino add changes relative to the sequence of a 
polypeptide of flie invention fipom a recombinant source can be made using any suitable 
technique e.g. by using ate-directed mutagenesis. 

According to the various embodiments of this aspect of the invention, tiie promoter 
will preferably be of mammalian origin, but also may be from a non-mammalian 
animal, plant, yeast or bacteria. The promoter may be selected from but is not limited 
to promoter elements of the following inducible genes: 

whose expression is modified in response to disturbances in tiie homeostatic 
state of DNA in tiie cell. These distiarbances may include chemical alteration of 
nucleic adds or precursor nucleotides, inhibition of DNA syntiiesis and 
inhibition of DNA replication The sequence can be selected from but not 



20 



25 



30 
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limited to the group consisting of c-myc (Hoffman et al Oncogene 21 3414- 
3421), p21AVAF-l (El-Diery Curr. Top. Microbiol. Immunol. 227 121-137 
(1998); El-Diery Cell Death Differ. 8 1066-1075 (2001); Dotto Biochim. 
Biophys. Acta 1471 43-56 (2000)), MDM2 (Alarcon- Vargas & Ronai 
5 Carcinogenesis 23 541-547 (2002); Deb & Front Bioscience 7 235-243 

(2002)), Gadd45 (Sheikh al Bio'chem. Pharmacol. 59 43-45 (2000)), FasL 
(Wajant Science 296 1635-1636 (2002)), GAHSP40 (Hamajima et al J. Cell 
Biol. 84 401-407 (2002)), TRAIL-R2/DR5 (Wu et al Adv.Exp. Med. Biol. 465 
143-151 (2000); El-Diery Cell Death Differ. 8 1066-1075 (2001)), BTG2/PC3 
10 (Tirone et alJ. Cell. Physiol. 187 155-165 (2001)); 

whose transcription is modified in response to oxidative stress. The sequence 
can be selected irom but not limited to the group consisting of MnSOD and/or 
CuZnSOD (HalUwell Free Radic. Res. 31 261-272 (1999); Gutteridge & 

15 Halliwell Ann. NY Acad. ScL 899 136-147 (2000)), IkB (Ghosh & Karin Cell 

109 Suppl.., S81-96 (2002)), ATF4 (Hai & Hartman Gene 273 1-11 (2001)), 
xanthine oxidase (Pristos Chem. Biol Interact. 129 195-208 (2000)), COX2 
(EBnz & Brune J. Pharmacol Exp. Ther. 300 376-375 (2002) ), iNOS 
(Alderton et al Biochem. J. 357 593-615 (2001)), Ets-2 (Bartel et al Oncogene 

20 19 6443-6454 (2000)), FasiyCD95L (Wajant Science 296 1635-1636 (2002)), 

•yOCS (Lu Curr. Top. Cell Regul 36 95-116 (2000); Soltaninassab et al J. 
Cell Physiol 182 163-170 (2000)), ORP150 (Ozawa et al Cancer Res. 61 
4206-4213 (2001); Ozawa etalJ. Biol Chem. 274 6397-6404 (1999)). 

25 whose expression is modified in response to hepatotoxic stress. The sequence 

can be selected from but not limited to the group consisting of Lrg-21 
(Drysdale et al Mol Immunol 33 989-998 (1996)), SOCS-2 and/or SOCS-3 
(ToDet-Egnell et al Endocrinol 140 3693-3704 (1999), PAI-1 (Fink et al Cell 
Physiol Biochem. 11 105-114 (2001)), GBP28/adiponectin (Yoda-Murakami 

30 et al Biochem. Biophys. Res. Commun. 285 372-377 (2001)), a-1 acid 

glycoprotein (Komori et al Biochem Pliarmacol 62 1391-1397 (2001)), 
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metallothioneine I (Palmiter et al Mol. Cell. Biol. 13 5266-5275 (1993)), 
metallothioneine H (ScWager & UssiApp. Toxicol 20 395-405 (2000)), ATF3 
(Hai & Haitman Gene 273 1-11 (2001)), IGFbp-3 (Popovici et id J. Clin. 
Endocrinol. Metab. 86 2653-2639 (2001)), VDGF (Ido et al Cancer Res. 61 
3016-3021 (2001)) and HEFla CTacchini et al Biochem. Pharmacol 63 139- 
148 (2002)). 

whose expression is modified in response to a pro-apoptotic stimulus. The 
sequence can be selected from but not limited to the group consisting of Gadd 
34 (Hollander et al J. Biol Chetn. 272 13731-13737 (1997)), GAHSP40 
(Hamajima et al J. Cell Biol 84 401-407 (2002)), TRAIL-R2/DR5 (Wu et al 
Adv.Exp. Med. Biol 465 143-151 (2000); H-Diery Cett Death D^er. 8 1066- 
1075 (2001)), c-fos (Teng Int. Rev. Cytol 197 137-202 (2000)), 
CHOP/Gaddl53 (Talukder et al Oncogene 21 4280-4300 (2002)), APAF-1 
(Cecconi & Gruss Cell Mol Life Sci. S 1688-1698 (2001)), Gadd45 (Sheikh et 
al Biochem. Pharmacol 59 43-45 (2000), BTG2/PC3 CKione /. Cell Physiol 
187 155-165 (2001)), Peg3/Pwl (Relaix et al Proc. Nat'l Acad Sci. USA 97 
2105-2110 (2000)). Siah la (Maeda et al FEBS Lett. 512 223-226 (2002)), S29 
ribosomal protdn (Khaxma et al Biochem. Biophys. Res. Commun. 277 476- 
486 (2000)), FasiyCD95L (Wajant Science 296 1635-1636 (2002)), tissue 
tranglutaminase (Chen & Mehta Ijtt. J. Cell Biol 31 817-836 (1999)), GRP78 
(Rao et al FEBS Lett. 514 122-128 (2002)), Nur77/NGFI-B (Winoto Int. Arcli. 
Allergy Immunol 105 344-346 (1994)), CyclophilinD (Andreeva et al Int. J. 
Exp. Pathol 80 305-315 (1999)), p73 (Yang et al Trends Genet. 18 90-95 
(2002)) and Bak (Lutz BiocJiem. Soc. Trans. 28 51-56 (2000)). 

whose expression is modified in response to the administration of chemicals or 
drugs. The sequence can be selected fiiom but not limited to the list comprised 
of xraiobiotic metaboh'sing cytochrome p450 enzymes from the 2A, 2B, 2C, 
2D, 2E, 2S, 3 A, 4A and 4B gene families (Smith et al Xenobiotica 28 1129- 
1165 (1998); HonkasM & Negishi J. Bipchern. Mol Toxicol 12 3-9 (1998); 
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Raucy et al J, Phaniiacol Exp. Ther. 302 475-482 (2002); Quattrochi & 
Guzelian Drug Metab. Dispos. 29 615-622 (2001)). 



The promoter element may also be a synthetic promoter sequence comprised of a 
5 minimal eukaryote consensus promoter operatively linked to one or more sequence 
elements known to confer transcriptional inducibility in response to specific stimulus. 
A minimal eukaryotic consensus promoter is one that will direct transcription by 
eukaryotic polymerases only if associated with functional promoter elements or 
transcription factor binding sites. An example of which is the PhCMV*-l (Furth et al 

10 Proc. Natl Acad. Scu USA 91 9302-9306 (1994)). Sequence elements known to 
confer transcriptional induction in response to specific stimulus include promoter 
elements (Montoliu et al Proc. Nat'l Acad Set USA 92 4244-4248 (1995)) or 
transcription factor binding sites; these will be chosen from but are not limited to the 
list comprising the aryl hydrocarbon (Ah)/Ah nuclear translocator (ARNT) receptor 

15 response element^ the antioxidant response element (ARE), the xenobiotic response 
element (XRE). 

A nucleic acid constract according to the invention may suitably be inserted into a 
vector which is an expression vector that contains nucleic add sequences as defined 
20 above. The terai "vector" or "expression vector'! generally refers to any nucleic acid 
vector which may be RNA, DNA or cDNA. 

The term "expression vector" may include, among others, chromosomal, episomal, 
and virus-derived vectors, for example, vectors derived from bacterial plasmids, from 

25 bacteriophage, from transposons, from yeast episomes, from insertion elements, firom 
yeast chromosomal elements, from virases such as baculoviruses, papova virases, such 
as SV40, vaccinia virases, adenoviruses, fowl pox virases, pseudorabies viruses and 
retrovirases, and vectors derived firom combinations thereof, such as those derived 
from plasmid and bacteriophage genetic elements, such as cosmids and phagemids. 

30 Generally, any vector suitable to maintain, propagate or express nucleic acid to 
express a polypeptide in a host may be used for expression in this regard. 



NSbOClD: <WO__ 2004011 676A2 I > 



10 



wo 2004/011676 

PCT/GB2003/003192 

21 



Recombinant expression vectors wiU include, for example, origins of repKcation, a 
promoter preferably derived from a highly expressed gene to direct transcription of a 
structural sequence as defined above, and a selectable marker to pennit isolation of 
vector containing cells after exposure to the vector. 

Expression vectors may comprise an origin of replication, a suitable promoter as 
defined above and/or enhancer, and also any necessary ribosome binding sites, 
polyadenylation regions, splice donor and acceptor sites, transcriptional termination 
sequences, and 5'- flanking non-transcribed sequences that are necessary for 
expression. Preferred expression vectors according to the present invention may be 
devoid of enhancer elements. 

The expression vectors may also include selectable markers, such as antibiotic 
15 resistance, which enable the vectors to be propagated. 

According to a second aspect of the invention there is provided a nucleic add 
construct comprising a stress inducible promoter operatively isolated from a nucleic 
add sequence encoding a member of the lipocaKn protein family by a nucleotide 
20 sequence flanked by nudeic add sequences recognised by a site specific recombinase, 
or by insertion such that it is inverted with respect to the transcription unit encoding a 
member of the lipocaUn protdn family. The recombinase recognition sites are 
arranged in such a way that the isolator sequence is deleted or the inverted promoter's 
orientation is reversed in the presence of the recombinase. The construct also 
comprises a nucleic add sequence comprising a tissue specific promoter operativdy 
Unked to a gene encoding the coding sequence for the site spedfic recombinase. 

Stress inducible promoters may be as described in relation to the first aspect of the 
invention. 

30 



25 
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This aspect allows for detecting reporter transgene induction in specified tissues only. 
By controlling the appropriate recombinase expression using a tissue specific 
promoter, the inducible transgene will only be viable in those tissues in which the 
promoter is active. For example, by driving recombinase activity from a liver specific 
promoter, only the liver will contain re-arranged reporter construct, and hence will the 
only tissue in which reporter induction can occur. 

Tissue specific promoters are a class of gene promoters whose function is restricted 
solely (or more usually, maily) to a particular cell type or tissue. 

Examples include promoters from the liver, pancreas, mammary gland, squamous 
epithelium, small intestine, skeletal muscle, smooth muscle, striated muscle, heart, 
prostate, adipose tissue, neural crest, brain, kidney and lung. Particular instances of 
tissue specific promoters are as follows (although, the invention is not limited as 
such): 



Tissue 


Example of tissue specific promoter 


liver 


Albumin (Pinkert et al Genes Dev 1987 1: 268-276) 


Liver 


a-fetoprotein (Wen etalDNA Cell Biol 1991 7: 525- 
536) 


Liver 


al-antitrypsin (Shen etalDNA 1989 8 (2): 101-8) 


Pancreas 


Insulin 11 ((a) Gannon et al Genesis 2000 26(2): 139- 
42); (b) Ray et al Int J Pancreatol 1999 25 (3): 157-63) 


Pancreas 


Pdx-1 (Gerrish et al J Biol Chem 2000 275 (5):3485- 
92) 


Mammary gland 


P-Lactoglobulin ((a) Selbert et al Transgenic Res 1998 
7 (5):387-96); (b) Webster et al Cell Mol Biol Res 1995 
41(l):ll-5) 


Mammary gland 


Whey acid protein (Wagner et al Nucleic Acids Res 
1997 25 (21):4323-30) 



wo 2004/011676 



23 



PCT/GB2003/003192 



Tissue 


Example of tissue specific promoter 


SquamoTis epithelium 


Keratin 5 (Brown et al Curr Biol 1998 8 (9):5l6-24) 


Squamous epithelium 


Keratin 14 (Vassar et al Proc Natl Acad SciUSA 
1989 86(5):1563-7) 


Squamous epithelium 


Loricrin (DiSepib et al Differentiation. 1999 64 
(4):225-35) 


Small intestine 


Fatty acid binding protein (Sweetser et al Proc Natl 
AcadSciUSA. 1988 85 (24):9611-5) 


Small intestine 


sucrase-isomaltase (Markowitz et alAm J Physiol 1995 
269 (6 Pt l):G925-39) 


Skeletal muscle 


Myosm hght chain If (Bothe et al Genesis 2000 26 
(2): 165-6) 


Smooth muscle 


SmMHC (Xin etal Physiol Genomics 2002 10 (3):211- 

5) 


Striated muscle 


a-skeletal actin (Miniou et al Nucleic Acids Res 1999 
27 (19):e27) 


Heart 


a-myosin heavy chain (Heger CircRes. 2002 90 
(l):93-9) 


Prostate 


Probasin (Greenberg et al Mol Endocrinol 1994 8:230- 
239) 


Adipose tissue 


aP2 (Gnudi et al Am J Physiol 1996 270 (4 Pt 2):R785- 
92) 


Neural crest 


Pax3 (Goulding et al EMBO J 1991 10 (5):1135-47) 


Neural crest 


Protein 0 (Y amauchi et cd Dev Biol 1999 212 (l):i91- 


Brain 


CaMKn (Tomioka et al Brain Res Mol Brain Res 2002 
108 (l-2):18-32) 


Lung 


surfactant protein C (Koifhagen etalJ Clin Invest 
1994 93(4):1691-9) 
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The recombination event producing- an active reporter transcription unit may therefore 
only take place in tissues where the recombinase is expressed In this way the reporter 
may only be expressed in specified tissue types where expression of the recombinase 
results in a functional transcription unit comprised of die inducible promoter linked to 
5 the promoter. Site specific recombinase systems know to perform such a function 
include the bacteriophage PI cre-lox and the bacterial FLIP systems. The site specific 
recombinase sequences may therefore be two ZaxP sites of bacteriophage PI 

. The use of site specific recombination systems to generate precisely defined deletions in 
10 cultured mammalian cells has been demonstrated Gu et al. {Cell 73 1155-1164 (1993)) 
describe how a deletion in the immunoglobulin switch region in mouse ES ceDs was 
generated between two copies of the bacteriophage PI loxP site by transient expression 
of the Cre site-specific recombinase, leaving a single loxP site. Similarly, yeast FLP 
recombinase has been used to precisely delete a selectable marker defined by 
15 recombinase target sites in mouse erythroleukemia cells (Fiering et aL, Proc. Natl Acad. 
Set USA 90 8469-8473 (1993)). The Cre lox system is exemplified below, but other site- 
specific recombinase systems could be used 

A construct used in the Cre lox system will usually have the following three functional 
20 elements: 

1 - The expression cassette; 

2. A negative selectable marker (e.g. Herpes simplex virus thymidine kinase 
25 (TK) gene) expressed under the control of a ubiquitously expressed promoter 

(e.g. phosphoglycerate kinase (Soriano et at, CeU 64 693-702 (1991)); and 

3. Two copies of the bacteriophage PI site specific recombination site loxP 
(Baubonis et dL, Nuc. Acids. Res. 21 2025-2029 (1993)) located at either end of 

30 theDNAfiragment. 
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This construct can be eliminated from host cells or cell lines containing it by means of 
site specific recombination between the two loxP sites mediated by Cre recombinase" 
protein which can be introduced into the ceUs by lipofection (Baubonis et al., Nuc. Acids 
Res. 21 2025-2029 (1993)). Cells which have deleted DNA between the two loxP sites 
are selected for loss of the TK gene (or other negative selectable maiker) by growth in 
medium containing the j^propriate drag (ganciclovir in the case of TK). 

According to the third aspect of the invention there is provided a host cell transfected 
with a nucleic acid construct according to any one of the previous aspects of the 
invention. ITie cell type is preferably of human or non-human mammalian origin but 
may also be of other animal, plant, yeast or bacterial origin. For example, HEPAl-6, 
mouse hepatoma epithelial cells; HEK293. human embryonic kidney epitheKal cells;' 
COS-1. AfiScan green monkey fibroblasts; CHO. Chinese hamster ovary epitheUal 
ceDs; HT 29. human colon adenocarcinoma epithelial cells; MCF7, human breast 
adenocarcinoma epithelial-like cells; HeLa. human cervical carcinoma epithehal cells. 
HEP G2. human hepatocyte carcinoma epithelial cells; PC3. human prostate 
adenocarcinoma epithelial cells; A2780. hmnan ovarian carcinoma epitheKal ceHs. 

Introduction of an expression vector into the host cell can be effected by calcium 
phosphate transfection, DEAE-dextran mediated transfection. micix)injection. catiomc 
lipid-mediated transfection. electroporation, transduction, scrape loading, ballistic 
introduction, infection of other methods. Such methods are described in many 
standard laboratory manuals, such as Samhrook et al.. Molecular Qoning: A 
Laboratory Manual, 2^ Ed.. Cold Spring Harbor Laboratory Press. Cold Spring 
Harbor, N.Y. (1989). 

According to the fourth aspect of the invention, there is proAdded a transgenic non- 
human animal in which the ceUs of the non-human animal express the protein encoded 
by the nucleic acid construct according to any one of the previous aspects of the 
invention. Suitably, the non-human animal is a non-human mammal. Tho transgenic 
animal is preferably a mouse but may be another mammalian species, for ex^ple 
another rodent, e.g. a rat or a guinea pig. or another species such as rabbit, or a canine 
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or feline, or an ungulate species such as ovine, porcine, equine, caprine, bovine, or a 
non-mammalian animal species, e.g. an avian (such as poultry, e.g. chicken or turkey). 

In embodiments of the invention relating to the preparation of a transfected host cell or 
5 a transgenic non-human animal comprising the use of a nucleic acid construct as 
previously described, the cell or non-human animal may be subjected to further 
transgenesis, in v^hich the transgenesis is the introduction of an additional gene or 
genes or protein-encoding nucleic acid sequence or sequences. The transgenesis may 
be transient or stable transfection of a cell or a cell line, an episomal expression system 
10 in a cell or a cell line, or preparation of a transgenic non-human animal by pronuclear 
microinjection, through recombination events in embryonic stem (ES) cells or by 
transfection of a cell whose nucleus is to be used as a donor nucleus in a nuclear 
transfer cloning procedure. 

15 Methods of preparing a transgenic cell or cell line, or a transgenic non human animal, 
in which the method comprises transient or stable transfection of a cell or a cell line, 
expression of an episomal expression system in a cell or cell line, or pronuclear 
microinjection, recombination events in ES cells, or other cell line or by transfection 
of a cell line which may be differentiated down different developmental pathways and 

20 whose nucleus is to be used as the donor for nuclear transfer; wherein expression of an 
additional nucleic acid sequence or constract is used to screen for transfection or 
transgenesis in accordance with the first, second, third, or fourth aspects of the 
invention. Examples include use of selectable markers conferring resistance to 
antibiotics added to the growth medium of cells, e.g. neomycin resistance marker 

25 conferring resistance to G418. Further examples involve detection using nucleic acid 
sequences that are of complementary sequence and which will hybridise with, or a 
component of, the nucleic acid sequence in accordance with the first, second, third, or 
fourth aspects of the invention. Examples would include Southern blot analysis, 
northCTfi blot analysis and PGR. 

30 
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According to the fifth aspect of the invention, there is provided the use of a nucleic 
acid construct in accordance with any one of the first, second, third, or fourth aspects 
of the invention for the detection of a gene activation event resulting ftom a change in 
altered metabolic status in a cell in vitro or in vivo. 

The gene activation event may be the icsult of induction of toxicological stress, 
metabolic changes, or disease that may be. but is not limited to. the result of viral.' 
bacterial, fungal or parasitic infection. 

According to the sixth aspect of the invention there is provided the use of a nucleic 
acid construct comprising a nucleic acid sequence encoding a member of the lipocalin 
protein family, wherein said lipocalin protein is heterologous to the ceU in which it is 
expressed, for the detection of a gene activation event resulting ftom a change in 
altered metabolic status in a cell in vitro or in vivo. 

The gene, activation event may be the result of induction of toxicological stress, 
metabolic changes, disease that may be. but is not limited to. the result of viral,' 
bactaial, fungal or parasitic infection. 

Uses in accordance with the fifth and sixth aspects of the invention also extend to the 
detection of disease states or characterisation of disease models in a cell, cell line or 
non human ti^sgenic animal where a change in the gene expression profile within a 
target ceU or tissue type is altered as a consequence of the disease. Diseases in the 
context of tiiis aspect of the invention which are detectable under the methods 
disclosed may be defined as infectious disease, cancer, inflammatory disease, 
cardiovascular disease, metabolic disease, neurological disease and disease with a 
genetic basis. 

An additional use in accordance with this aspect of the invention involves the growtii 
of a transfected ceU line in accordance with the third aspect in- a suitable 
immunocomprornised mouse sti^n (referred to as a xenograft), for example, the nude 
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mouse, wherein an alteration in the expression of the reporter described in the first or 
second aspects of the invention may be used as a measure of altered metabolic status 
of the host as a result of toxicological stress, metabolic changes, disease with a genetic 
basis or disease that may be, but is not limited to, the result of viral, bacterial, fungal 
5 or parasitic infection. The scope of this use may also be of use in monitoring the 
effects of exogenous chemicals or drugs on the expression of the reporter construct. 

The fifth and sixth aspects of the invention extend to methods of detecting a gene 
activation event i?i vitro or in vivo. 

10 

In an embodiment according to the fifth aspect of the invention, the method comprises 
assaying a host cell stably transfected with a nucleic acid constract in accordance with 
any one of the first or second aspects of the invention, or a transgenic non-human 
animal according to the fourth aspect of the invention, in which the cell or animal is 
15 subjected to a gene activation event that is signalled by expression of a peptide tagged 
lipocalin reporter gene. 

In an embodiment according to the sixth aspect of the invention, the method comprises 
assaying a host cell stably transfected with a nucleic acid construct comprising a 
20 nucleic acid sequence encoding a member of the lipocalin protein family, wherein said 
lipocalin protein is heterologous to the cell in which it is expressed, or a transgenic 
non-human animal whose cells express such a constract, in which the cell or animal is 
subjected to a gene activation event that is signalled by expression of a peptide tagged 
lipocalin reporter gene. 

25 

Accordingly there is provided a method of screening for, or monitoring of 
toxicologically induced stress in a cell or a cell line or a non-human animal, 
comprising the use of a cell, cell line or non hmnan animal which has been transfected 
with or carries a nucleic acid construct as described above. 

30 
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Toxicological stress may be defined as DNA damage, oxidative stress, post 
translational chemical modification of cellular proteins, chemical modification of 
cellular nucleic acids, apoptosis. cell cycle arrest, hyperplasia, immunological 
changes, effects consequent to changes in hormone levels or chemical modification of 
hormones, or other factors which could lead to cell damage. 

Accordingly, there is also provided a method for screening and characterising viral, 
bacterial, fungal, and parasitic infection comprising tiie use of a cell, cell line or non 
human animal which has been ti^sfected with or carries a nucleic acid construct as 
described above. 

Accordingly, there is additionally provided a method for screening for cancer, 
inflammatory disease, cardiovascular disease, metabolic disease, neurological disease 
and disease with a genetic basis comprising tiie use of a ceU, cell Kne or non human 
animal which has been transfected with or carries a nucleic acid constiiict as described 
above. 

Jn these contexts tiie ceU may be transientiy tiransfected. maintaining flie nucleic acid 
constiruct as described above episomaUy and temporarily. Alternatively ceUs are stably 
transfected whereby the nucleic acid construct is permanently and stably integrated 
into the tiiansfected cells' chromosomal DNA. 

Also in fliis context transgenic animal is defined as a non human ti-ansgenic animal 
witii flie nucleic acid construct as defined above preferably integrated into its genomic 
DNA in all or some of its cells. 

Expression of the peptide tagged lipocalin protein in respect of the fiftii aspect of the 
invention can be assayed for by measuring levels of the lipocalin protein in cell cultiire 
medium or purified or partiaDy purified fractions tiiereof . 
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Lipocalins are known to be secreted into body fluids and some are known to be 
eliminated in urine. Expression of the peptide tagged lipocalin protein in accordance 
with the fourth aspect of the invention therefore can be assayed for by measuring 
levels of lipocalin secreted into harvestable body fluids. In a preferred embodiment of 
5 the invention the body fluid will be urine, but may also be selected from the list 
including milk, saliva, tears, semen, blood and cerebrospinal fluid, or purified or 
partially purified fractions thereof. 

Detection and quantification of the tagged lipocalins secreted from cultured cells into 
10 tissue culture medium or transgenic non-human animal body fluid may be achieved 
using a number of methods known to those skilled in the art: 

1. Immunological methods. 

(i) The assay may be an ELISA whereby an antibody or antiserum containing a single 
15 or noixture of antibodies recognising either the lipocalin reporter itself or the peptide 
tag attached to and is used as a capture antibody to coat a microtitre plate or other 
medium suitable for conducting the assay. The culture medium or body fluid 
containing the reporter gene product (analyte) is added to the microtitre plate to allow 
binding of the analyte. Addition of the same antibody or antiserum that has been 
20 conjugated to an enzyme, commonly horseradish peroxidase, is used as a second 
antibody. Addition of a suitable substrate, preferably one producing a colour product 
following conversion by the enzyme is used to quantify the analyte in proportion to 
how much second antibody conjugate has been bound. 

25 (ii) Competitive ELISA. In an alternative form the tissue culture medium or the body 
fluid (analyte) sample containing the tagged lipocalin is bound to a support suitable for 
conducting the assay. In a separate reaction a limited standard amount of antibody 
specifically recognising the reporter gene product is added to a separate aliquot of the 
same and allowed to bind. This is added to the analyte bound to the support to allow 

30 remaining free antibody to bind. A second, enzyme conjugated antibody against for 
example the Fc region of the first antibody is allowed to bind and the colorimetric 
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readout can be used to quantify the analyte whereby the degree of colour change is 
invCTsely proportional to the level of analyte in the sample. 

(iii) Western blot analysis 

Transfected cell homogenates were prepared by incubation of cells in homogenization 
buffer (140mM NaCl, 50mM Tris-HQ pH7.5, ImM EDTA, 1% Tiiton-100) for 30 
minutes on ice. Following a brief centrifugation to lemove insoluble material the 
cleared supematants were assayed for protein content. A volume equivalent to 40ng 
cell extract and an equal volume of cell medium were subjected to SDS-PAGE and 
blotted onto nitrocellulose (Schleicher and SchueU, Dassel, Germany) membrane 
using a semi-dry blotting apparatus (Bio-Rad, Richmond, CA). The membranes were 
blocked for 1 hour in blocking buffer (5% NEDM w/v in PBS) then incubated with 
myc mAb (Invitrogen life Technologies. Carlsbad, CA) diluted in blocking buffer for 
2 hours with continues agitation. After a series of washes in PBST (PBS plus 0.05% 
Tween-20), the membrane was incubated in an anti-mouse antibody conjugated to 
HRP diluted in blocking buffer for one hour with agitation, and after another series of 
washes in PBST the HRP activity was developed using an ECL kit (Pierce, Rockford, 
US) and captured on autoradiographic film (Kodak). 

(iv) Fluorescence polarisation. The antibody specificaUy recognising the reporter 
lipocalin protein is conjugated with fluorescein and mixed with the analyte produced. 
This method quantifies the analyte by direct measurement of the amount of antibody- 
antigen complex present. This method may also be adapted to measure any protein- 
protein interaction. 



2. Release of a labelled substrate. E.g. radioactive (CAT) or fluorometric. colorimetric. 

Detection of conversion of substrate due to enzymatic activity of the lipocalin reporter 
protein produced. The nature of substrate conversion may or may not fall into one or 
more of the following event categories: Proteolysis, phosphorylation, acetylation, 
sulphation, methylation 
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3. Detection of multiple substrates. Where a multiple of lipocalin reporter proteins are 
used methods suitable for detection of such events could include but not necessarily be 
limited to: 

5 

(i) Mass spectrometry 

(ii) Nuclear magnetic resonance (NMR) 

10 In a preferred embodiment of the invention there is provided a method of detecting a 
reporter gene activation event, comprising the steps of: 

1. Transfecting a cell or xnicroinjecting the pronucleus of a fertilised mouse egg 
with a nucleic acid sequence encoding a lipocalin protein tagged with a peptide or 

15 protein as described above in accordance with the first, second, third, or fourth 

aspects of the invention. Optionally use the microinjected egg or transfected mouse 
ES cell line; 

2. Exposing the transfected cell, cell line or transgenic non human animal to a 
20 stimulus which may or may not cause a change in metabolic status resulting 

alteration in gene expression; and. 

3. Using a suitable assay to determine the level expression of the tagged lipocalin 
reporter, for example using detection methods such as ELISA, RIA, Mass 

25 spectrometry, NMR, telemetric methods. 

In step (1), the detectable lipocalin protein may be a heterologous protein to the cell in 
which the nucleic acid constract is expressed. Such an *\mtagged" lipocalin reporter 
protein may not therefore need a peptide or protein tag for detection. 

30 
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Methods and uses in accordance with the present invention offer significant advances 
in investigating any area in which modified gene expression plays a significant role. 
Such peptide tagged lipocalin genes will be of use in cells and transgenic animals to 
detect activity of selected genes. Specific applications include but are not restricted to: 

Providing a rapid and robust in vivo screening system for assessing the 
potential toxic effects of chemicals. 

Provide information on the mechanism of toxicity. Such information 
could be used to eliminate compounds from a selection process or 
suggest possible modifications to a compound. 
Provide information on the effect of combinations of compounds. 
Allow monitoring of variation in reporter gene expression over time by 
measuring levels of reporter(s) in urine at different time intervals. 
Assessment of changes in gene expression associated with pathogenic 
infection. 

Assessment of changes in gene expression associated with 
neurological, cardiovascular and metabolic diseases. 
Assessment of changes in gene expression associated with cancer. 
Provide information allowing validation of drug target selection e.g. by 
matching reporter expression profile to actions of toxins whose 
mechanism is defined and understood. 

Use for evaluating compounds as therapeutic strategies aimed at 
reversing a toxic, metabolic, or degenerative phenotype. 
Assessment of changes in gene expression resulting from 
environmental and/or behavioural changes. 

Preferred features for the second and subsequent aspects of the invention are as for the 
first aspect mutatis mutandis. 

The present invention will now be described with reference to the following examples 
which are present for the purposes of illustration only and should no be construed as 
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being limited with respect to the invention. Reference in the application is also made 
to a number of drawings in which: 

FIGURE 1 shows the position of the peptide tag at the amino terminal or 
carboxy terminal or inserted internally with respect to the amino acid sequence 
of the lipocalin reporter protein 

FIGURE 2 shows the plasmid map for pal ATBLG 

FIGURE 3 shows the plasmid map for pXCS'MycMUP 

FIGURE 4 shows the plasmid map for pcDNA.S'mycMUP 

FIGURE 5 shows the plasmid map for pX4T3'MYCMUP 

FIGURE 6 shows the results of expression of Myc tagged MUP 

FIGURE 7 shows the DNA and amino acid sequences of the MUP clone 
Mmup9a. The 18 amino acid secretion signal peptide is shown in bold (amino 
acid residues 1 to 18). 

FIGURE 8 shows the DNA and amino acid sequence of the recombinant 
mMUP reporter molecule. The protein contains a sixteen amino acid N- 
temiinal addition, comprising of 6 amino acids from the pGEX vector (italics — 
amino acid residues 1 to 6) and the c-myc epitope (shown in bold ~ amino acid 
residues 7 to 16). 

FIGURE 9 shows the DNA and amino acid sequence of the recombinant 
BLGm reporter molecule. The protein contains a six amino acid N-temiinal 
addition from the pGEX vector (italics - amino acid residues 1 to 6) and the C- 
teraiinal c-myc epitope (bold - amino acid residues 170 to 179). 
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FIGURE 10 shows (a) West«n blot of GST-BLGm fusion protein. Lanes 1 to 
6 show fractions eluted from a glutathione-agarose column. Lane C, mMUP 
protdn control, (b) Western blot of GST-MUPm fusion protein. Lanes 1 to 7 
show fractions eluted from glutathione-agarose column. Blots were probed 
using 9E10 anti-myc antibody directly conjugated to HDRP (Roche). 

FIGURE 11 shows Western blot analysis of urine samples (ISjxl) collected 
from mice, following injection with either (A) vehicle or recombinant mMUP 
(2.5mg/kg); or (B) recombinant mMUP (5 and lOmg/kg). Blots were probed 
with anti-myc antibody. Uninjected recombinant GSTmMUP (~ 45kDa, open 
arrow) was included as a positive control (right hand lane). The closed arrow 
indicates the position of the -ISKDa mMUP control band. 

FIGURE 12 shows Western blot analysis of urine samples taken at various 
time points (in hours) and plasma (?) at 24 hours from mice that had been 
injected with recombinant GST-BLGm and GST-mMUP. Blots were probed 
with an anti-GST antibody. Arrow indicates the expected size of the band 
corresponding to GST-mMUP protein. 

FIGURE 13 shows the 3-dimensional solution structure of IvIUP. The 
antiparallel P-sheets are shown in brown, and the loop regions in blue. The EF 
loop is maiked, as is the FG loop. Red lines indicate amino acid positions 
where the internal restriction site additions wwe made. 

FIGURE 14 shows antibody detection of epitope tagged MUP reports 
proteins: (A) Haemaglutinin (EIA) tagged MUP protein was expressed in E. 
colU and extracts from induced (Lane 1) and uninduced (Lane 2) cells analysed 
by western blotting using an anti-HA antibody (3F10, Roche) HRP-conjugated 
second antibody and ECL detection (Amersham). Lane 3 contains molecular 
size markers. A specific.band of the_expected size is seen for the HA-tagged 
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GST-MUP fusion protein; (B) ERB tagged MUP protein was expressed in E. 
cell and extracts from induced (Lane 2) and uninduced (Lane 3) cells analysed 
by western blotting using an anti-ERB antibody (ICRF Technology), HRP- 
conjugated second antibody and ECL detection (Amersham). (Lane 1 
molecular size markers). A specific band of the expected size is seen for the 
ERB-tagged GST-MUP fusion protein. Extensive photo-bleaching is seen in 
Lane 1, due to the amount of protein present. 

FIGURE 15 shows modified MUP proteins produced from the pSecTag vector. 
The various modifications made to the wild-type MUP protein sequence 
(overlined region) are shown: the IgK signal peptide leader, which is cleaved 
during processing (++++);; the c-myc epitope tag (underlined); the iTag 
insertion sequence in the FG loop (italics); and the Clone 100 epitope tag 
(bold), and the other C- and N-terminal modifications and additions. 

FIGURE 16 shows results of pSecTag MUP constructs that were transfected 
into A2780 cells using Fugene, and the medium (SO^l) directly examined for 
secreted protein by Western blotting, using anti-myc antibody 9E10. Lane C, 
recombinant mMUP control; Lane 1, pSMLiclOO; Lane 2, pSML; Lane 3, 
pSM; Lane 4, pSecmMUP. Several protein bands are present in the 
pSecmMUP medium, due to the presence of multiple start sites in the 5'-region 
of this construct. 

FIGURE 17 shows analysis of mouse urine containing either GST or GST- 
mMUP, together with GST or GST-mMUP in phosphate buffered saline (PBS) 
for GST enzymic activity. The concentration of all proteins was lOOiig/ml. 
The graph shows GST enzymic activity, as absorbance (340nm) versus time, 
relative to the absorbance at the 30 second timepoint. 

FIGURE 18 shows the nucleotide sequence for ovine betalactoglobulin (BLG) 
(accession no. X12817), available from www.ncbi,nlm,nih , pov/entrz. 
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published by Harris.S et al Nucleic Acids Res. 16 (21), 10379-10380 (1988); 
Watson,CJ. et al Nucleic Acids Res. 19 (23), 6603-6610 (1991). The signal 
peptide is coded for by residues 842 to 895 and mature protein fiom 6 exons at 
residues 896..937,1602..1741,2586..2659,3772..3882.4551..4655,4869..4882 

HGURE 19 shows the amino acid sequence for ovine betalactoglobulin (BLG) 
coded for by the nucleotide sequence of Figure 16. 

HGURE 20 shows the cDNA encoding the mRNA of murine major urinary 
protein 1 (Mupl), (Accession no. MM 031188), ), available from 
.www.ncbi.nlm.mh.gov/entr7, published Lucke et cd Eur. J. BiochemJ266 (3), 
1210-1218 (1999); Abbate, et al J. Biomol. NMR 15 (2), 187-188 (1999); 
Ferrari et al FEBS Lett. 401 (1), 73-77 (1997); Held, et al Mol Cell. Biol. 7 
(10), 3705-3712 (1987); Bennett et al J. Cell Biol. 105 (3), 1073-1085 (1987); 
Shahan et al Mol Cell. Biol. 7 (5), 1938-1946 (1987); Qark et al EMBO J. 4 
(12), 3167-3171 (1985); Clailc, et al EMBO J. 4 (12), 3159-3165 (1985); 
Ghazal et al Proc. Nat'L Acad. Sci USA. 82 (12). 4182-4185 (1985); Kuhn et 
al Nucleic Acids Res. 12 (15), 6073-6090 (1984); Qark et al EMBO J. 3 (5), 
1045-1052 (1984); Krauter et al J. Cell Biol. 94 (2). 414-417 (1982); coding 
sequence from residues 1 12..654. 

FIGURE 21 shows the amino acid sequence for murine major urinary protein 
coded for by the nucleotide sequence of Figure 18. 

HGURE 22 shows the cDNA sequence encoding the mRNA of rat alpha-2-u 
globulin (accession no. M27434) ), available from 
www.ncbi .nlm.nih.gov/entrz. published by Roy et al 
J. Steroid Biochem.:n (4-6), 1129-1134 (1987) 
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FIGURE 23 shows the GST coding sequence derived from pGEX6p-l. The 
GST coding sequence is nucleotide residues 241-917. The residues highlighted 
in bold 

5 Leu Glu Val Leu Phe Gin Gly Pro 

ctg gaa gtt ctg ttc cag ggg ccc 

represent the PreScission™ Protese cleavage recognition sequence position 
918-938. The protease cleavage site allows for the production of cleaved myc- 
10 tagged proteins from the GST fusion proteins as described in Example 6. 

Example 1; Preparation of poclATBLG 

The alAT promoter (350bp) was excised from al AT/CAT (YuU et al Transgenic Res. 
4 70-74 (1995)) as a HindUI Smal fragment and inserted into pBluealAT, Digestion 
15 of this with EcoRV and Xhol allowed direct insertion of the alAT promoter into 
pXen6.S (Simon Temperley, CXR Biosciences) digested with the same enzymes. The 
microinjection fragment was purified after digestion of the plasmid with palATBLG 
(shown in Figure 2). 

20 Example 2; Preparation of pX4T3^MvcMUP 

A Xhol/Kpnl fragment encoding amino terminal c-Myc tagged mouse MUP was 

inserted into pXAM4 (CXR Biosciences) effectively placing it under the control of the 
CMV promoter. pXAM4 was previously constructed by inserting a PGR generated 
fragment containing the CMV promoter as a BamHl-XhoI fragment into a pSP72 
25 (Promega) multiple cloning site which had been modified by addition of a linker 
which added restriction sites allowing insertion of additional fragments downstream of 
the CMV promoter sequence. 

Example 3; Preparation of pXCS^MvcMUP 
30 A 2.5kb DNA fragment encompassing the murine CyplAl promoter arid upstream 
sequences was inserted into SstH/XhoI digested pX4T.3'MycMUP (Thomas 
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McCartney, CXR Biosciences) to -engineer a reporter vector capable of expressing 
COOH terminally c-Myc tagged MUP upon induction of the CYPl Al promoter using 
a suitable inducing agent, if the construct is used to transfect a suitable cell line or to 
generate a transgenic animal. 

Example 4; pcDNA^'MvcMUP 

A DNA fragment encompassing the COOH terminally c-Myc tagged MUP was 
excised from pX4T.3'Myc (Thomas McCartney, CXR Biosciences) to engineer an 
expression vector capable of constitutive expression of c-Myc tagged MUP if used to 
transfect a suitable cell line or to generate a transgenic animal. 

Example 5; Expression of Mvc-MTTP 

Constructs were tested by transient transfection of a 90% confluent monolayer of 
Hepal-6 ceDs in a T-25 flask using 6ug of DNA in accordance with the protocol 
supplied with lipofectamine transfection reagent (Invitrogen). 

Cells and 5ml of medium were harvested 48 hours post-transfection. Total protein 
from the cell pellets was obtained using 1ml TRI reagent (Sigma) per pellet in 
accordance with directions. Cellular protein was further purified using the PlusOne 
SDS-PAGE Clean-Up Kit (Amersham) in accordance with directions. 
Coixespondingly, protein was purified from lOOul samples of growth medium fiom 
each transfected cell batch using the PlusOne SDS-PAGE Clean-Up Kit in accordance 
with directi(»is. 

Cell extracts and culture medium from Hepal cells transfected with constructs 
designed to constitutively express NH3 and COOH terminally Myc tagged MUP 
coding sequences firom the CMV promoter (2°*^ and 3"" lanes from left respectively in 
both left and right panels; plasmids X4T5'MycMUP and X4T3'MycMUP 
respectively) were subject to SDS-PAGE. Results shown in Figure 6 
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Western blot analysis by probing with antibody against c-Myc showed the presence of 
COOH terminally tagged MUP in both cell extract and medium of Hepal cells (3"^ 
lane from left in both left and right hand panels). Results shown in Figure 6 

5 25% of the total cellular protein samples and the entire protein sample derived from 
the growth medium were analysed by SDS-PAGE followed by western blot in 
accordance with equipment manufacturer's (BIO-RAD) directions* The blot was 
probed using the murine monoclonal Anti-Myc antibody 9E10 (Sigma) in conjunction 
with anti-mouse Ig BRP conjugated antibody (Amersham). Visualisation was 
10 performed using ECL reagent (Amersham) in accordance with directions. 

Example 6; Production of recombinant epitope tagged lipocalin proteins 
Two candidate lipocalin family members, ovine beta-lactoglobulin (BLG) and mouse 
major urinary protein (MUP) have been shown to function as excreted reporter 
15 molecules. This has been achieved by introducing recombinant protein to mice via 
intravenous injection into the tail vein, followed by analysis of urine and plasma by 
western blotting. 

To expand the application of a secreted/excreted reporter, it is possible to modify the 
20 reporter protein by the addition of specific epitope tag. This should allow a single 
reporter protein backbone to report on a number of specific events within a single 
system. We have demonstrated the ability to introduce additional amino acid motifs 
containing epitope tags at the N-teraiinus, the C-terminus and at several internal loop 
positions of the lipocalin reporter protein, 

25 

Recombinant MUP and BLG were expressed in E.coli using the pGEX vector system 
(Amersham Bioscience), which expresses all inserted sequences as a C-terminal fusion 
protein with vector encoded glutathione-S-transferase (GST). GST may be removed 
from the insCTted fusion partner via a specific proteolytic cleavage site located at the C 
30 terminal end of GST. 
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A MUP clone, MmupPa, was derived from mouse liver KNA by RT-PCR, and the 
identity confinned by sequencing (Figure 7). This clone. Mmup9a, is almost identical 
(536/537 bases) to the MusMupl type I MUP clone (M16355, Genbank). The MUP 
coding sequence, minus the N-tenninal 18 amino acid signal peptide, was rederived 
from clone Mup9a, by PGR as an Ncol-Xhol fragment, and cloned into the E. coU 
expression vector pGEX-6PB (derived fiom pGEX-6P-l, Amersham Bioscience) to 
produce pGEX-MUP. A synthetic linker oligonucleotide was then used to add the c- 
myc epitope sequence, as an Ncol-Ncol fragment, to the 5'-end of the MUP coding 
sequence to give pGKX-mMUP. 

pCDS'mycBLG. containing the BLG precursor protein cDNA fused with a C-tenninal 
myc epitope tag, was constructed from the BLG cDNA clone pBlacD (Roslin 
Instimte). The C-teiminal myc-tagged BLG coding sequence, minus the 18 amino acid 
signal peptide, was derived by PGR from pCD3'mycBLG (containing the BLG 
precursor protein cDNA ftised with a C-terminal myc epitope tag) and cloned direcdy 
into pGEX-6PB, to produce pGEX-BLGm. 

Constructs pGEX-mMUP and pGEX-BLGm were then used to produce recombinant 
GST fusion proteins in E. coli DH5a, and the GST fragments removed by protease 
treatment (PreScission Protease, Amersham Bioscience) to generate N-terminally 
myc-tagged MUP (mMUP - Figure 8) and C-teiminally myc-tagged BLG (BLGm - 
Figure 9) lipocalin reporter proteins respectively. Purification of recombinant protein 
was achieved via affinity chromatography foUowing the manufacturers recommended 
protocols (Amersham Bioscience). 

Both the GST fusion precursors and the cleaved myc-tagged protein products were 
recognised on western blots (Figure 10) using horseradish peroxidase (HRP) diiecfly 
conjugated to an anti-myc antibody (9E10, Roche) and ECL chemfluminescent 
detection kit (Amersham Bioscience). 
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Example 7: In vivo excretion of MUP and BLG epitope tagged iipocalin reporter 
proteins 

In order to demonstrate the excretion of epitope-tagged MUP and BLG reporter 
5 proteins, recombinant epitope-tagged mMUP Iipocalin protein was injected i.v. into 
male CDl mice (3 doses, 2.5mg/kg, 5mg/kg and lOmg/kg with 3 mice per group, via 
the tail vein). A control group were also injected with the vehicle solution (isotonic 
sterile saline). After injection, urine samples were collected from mice, by scruffing, at 
approximately 30 minute time intervals over a 6h period. Mice were sacrificed after 24 
10 hours and urine and serum samples taken. 

Urine was analysed by SDS PAGE, followed by western transfer to nitrocellulose 
membrane (Hybond ECL, Amersham Bioscience) and probed with HRP-conjugated 
anti-myc antibody (9E10, Roche) and detected with the ECL detection kit (Amersham 
15 Bioscience). 

The results of this analysis are shown in Figure 11. From this, it can be seen that the 
majority of MUP protein was detected in the first two or three samples i.e. within 2h 
post injection. Urine samples collected at later time points and serum taken from 
20 animals after 24h did not contain detectable MUP reporter protein. These data clearly 
demonstrate that exogenous mMUP in the bloodstream of mice is eliminated rapidly 
and efficiently in the urine. 

Western blot analysis was repeated on all samples after three weeks to determine the 
25 stability of recombinant protein in mouse urine upon storage at ~20'*C. The results 
were similar to those initially obtained (data not shown), showing no appreciable 
decrease in sensitivity, demonstrating that mMUP protein is able to withstand long 
term freezer storage and thawing. 

30 In order to demonstrate the application of Iipocalin reporter proteins containing a large 
epitope tag (GST), tail vein injections were conducted subsequently with recombinant 
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myc-tagged lipocalin-GST fusion proteins (GST-BLGm and GST-mMUP). Each 
protein was injected at a dose of 5mg/kg. Samples were ftactionated by SDS PAGE 
and analysed by western blotting. Blots were probed using an anti-GST antibody 
(Sigma), HRP-conjugated anti-rabbit secondary antibody (Jackson ImmunoReseaich) 
and ECL detection kit (Amersham Bioscience). Urine samples collected early and late 
after IV injection and plasma from a terminal bleed were included in the analysis. 
From Figure 12, it can be seen that GST-BLGm and GST-mMUP proteins are detected 
in urine samples throughout the sampling period and also in plasma taken from the 
animal after 24 hours. 

The difference in excretion profiles between GST-mMUP fusion protein (45kDa mol. 
weight) and mMUP (~18kDa mol. weight) could reflect a difference in the 
physiological processing of the former (e.g. rcabsoiption via the kidney into the 
plasma) or less efficient excretion. A choice of non-invasive reporter molecule whose 
excretion characteristics differ in such a manner could prove useful, depending on 
whether a persistent readout or a more rapidly decaying, and thus responsive, signal 
are required. 

Example 8; Epitope tagging of lipocalin reporter protein 

MUP and BLG lipocalin reporter proteins have been successfully tagged with N- and 
C-teiminal tags (above data for GST and c-myc tags). Intemal loop positions within 
the MUP protein have also been used to introduce the peptide epitope sequences. 
Several potential positions for the introduction of epitope tags were chosen, from the 
MUP protein structure (Figure 15), as being in external loops. The initial position 
chosen to introduce a tag corresponded to a site within the EF loop of BLG protein 
that had previously been used to introduce a kinase recognition site. This had utilised a 
Clal restriction site in the BLG gene, however there is no conespondmg restriction 
site in the MUP gene. Consequentiy, the Mup cDNA sequence was modified by the 
introduction of a) an Avrll-Apal-Sbfl linker fragment into the sequence coding for EF 
loop region and b) a Spel-EcoRl-Nsil Unker firagment at the 3'end of the coding 
sequence. The particular restriction site combmations were chosen since they would 
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generate compatible oveihanging ends, for the insertion of adapter oligonucleotides 
containing epitope sequences. The MUP 5'-coding region from position 10 to 300, 
together with an additional GATGCGGTACCACCATGGTGTCTAGACTGCAG 5'- 
sequence (containing a Kozak signal, start codon and NcoI-KpnI-Xbal-PstI linker) and 
5 an additional CCTAGGC sequence (containing an Avrll restriction site) was generated 
by PGR. The corresponding MUP 3*-region from position 301 to 540, together with an 
additional TGCCTAGGGCCCTGCAGGGTA 5'-sequence (containing an Avrll- 
Apal-Sbfl linker) and ACTAGTGAATTCATGCATTGAGCTAGCCATC 3'sequence 
(containing an Spel-EcoRI-Nsil-Nhel linker and stop codon was generated by PGR. 
10 ligation of these two fragments, at the common Avail site generated the required 
modified MUP coding sequence, on a Ncol-Nhel fragment. 

Restriction digest with either Avrll/Sbfl (internal HF loop) or Spel/Nsil (C-tCTminus) 
results in an identical pattern of overhanging ends, to which double stranded 
15 oligonucleotide linkers, of the general form: 
CTAG N {NNN)x N TGCA 

N {mm)x 

where x is a multiple of 3, that contain an epitope tag, can anneal. 

20 MUP lipocalin reporter proteins have also been produced, in which the epitope has 
been introduced into the FG loop position. This has been accomplished by the 
insertion of a Hindni-BamHI-EcoRI linker fragment into the MUP coding sequence at 
the FG loop position. This has allowed the insertion of adapter oligonucleotides 
containing epitope sequences into the Hindlll/EcoRI sites. The IVIUP coding sequence, 

25 from position 1 to 348, together with an additional GGTACCACC 5'-sequence 
(containing a Kpnl restriction site and Kozak sequence) and an additional 
AAGCTTGGAACCGGATCC 3*-sequence (containing Hindlll-BamHI sites) was 
generated by PGR, as was the corresponding MUP coding sequence from position 349 
to 540, together with an additional GGATCCTCTTCAGAATTC 5'-sequence 

30 (containing BamHI and EcoRI restriction sites) and an additional 
GAGGAGAAAGTCATGTGTGAAGAGGATCTGTGAGCTAGC 3'-sequence 
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(containing the c-myc GluGlnLysLeuHeSerGluGluAspLeu epitope tag . stop codon 
and Nhel restriction site). Ligation of the two fragments, at the BanaHI site generated 
the modified MUP coding sequence, on a Ncol-Nhel fragment. 

Restriction digest with HBndlliyEcoRI results in overhanging ends, to which double 
stranded oligonucleotide linkers, of the general form: 
AGCT T (NNN)x G 

A (HISIN)x C TTAA 
where x is a multiple of 3, that contain an epitope tag, can anneal. 

Epitopes that have been inserted into the FG loop, by this method, include: 



Ebemaglutuiin 


(YPYDVPDYA) 


ClofoelOO 


(NVKFSnVRRRA) 


rablla 


(KQMSDRRENDMSPS) 


DOB 


(SGNEVSRAVLLPQSQ 


SGll 


(SSLSYTNPAVAATSANL) 


eibB4 


(RSHjQHPDYLQEYST) 


ARF 


(VSTLLRWERFPGHRQA) 


RYK 


(KFQQLVQCLTEEHAALGAYV) 


WILPEPl 


(QEQCQEVWRKRVISAFLKSP) 


HAFIO 


(RLSDKTGPVAQEKS) 



MUP coding sequences, containing these epitope tag sequences, were expressed in E. 
coli as GST fusion precursor proteins, and cleaved tagged MUP proteins, using the 
pGEX expression system (Amorsham Biosciences). 

FG loop modified MUP coding sequraice was cloned into NcoI-NotI cut pGEX6P 
vector to generate pGSLM. that contains the MUP coding region downstream of the 
GST coding sequence and Precissionase cleavage site. 
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Indi^ddual epitope tags were introduced by HindHWEcoRI digestion and annealing of 
epitope containing oligonucleotide linkers. 

E. coli strain TOPIO (Invitrogen) was transformed with the pGSLM-tag construct, 
5 using the manufacturers standard protocols. 

The resultant transfoimed bacterial strains were grown iii shaking flask culture to an 
ODeoo of 0.5-0.6. Once the optimal turbidity was attained a small sample was removed 
as a control and IPTG added to the remaining culture to a final concentration of 
10 0.5mM. Both the control sample (uninduced) and the induced cultures were grown for 
a further 2-3 hours. After the final growth step 0.25ml and 0.5ml of uninduced and 
induced culture respectively was spun down and resuspended in lOOul 6xGLB and 5- 
lOul of each run on NuPAGE gels (Invitrogen) to ascertain whether induction had 
taken place and the fusion product was the correct size. 

15 

The remaining induced culture (3.2L total for large preps) was spun down, lysed and 
cell debris removed by centrifugation. GST fusion proteins from cleared lysate were 
allowed to bind to Glutathione-Agarose beads (SIGMA) for 0.5-1 hour at +4**C. The 
protein/bead slurry was poured onto a gravity flow colmnn and the resultant gel bed 

20 washed thoroughly with lysis buffer to remove bacterial proteins. Fusion proteins were 
then eluted from the gel bed with excess Glutathione (lOmM in 50mM Tris pH8.0). 
Samples were checked via SDS-PAGE and Immunodetection before proceeding to 
cleave and purify the tagged MUP protein from the GST fusion. The purified eluate 
was dialysed in cleavage buffer (4x3 hours) and then incubated for 16 hours with at 

25 least 60 units of Precissionase at 44*^C. The digested protein was then added to a 
gravity flow colunm containing fresh Glutathione-Agarose beads which bound the 
GST and Precissionase allowing the elution of the cleaned, digested tagged MUP 
protein. The eluate was re-added twice to rasure complete removal of contaminating 
proteins and then concentrated using Centricon-P20 columns (MiUipore) to give the 

30 final protein solution. 
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10 



Extracts from induced and uninduced cells were analysed by western blotting for the 
presence of the relevant tagged MUP protein, using an epitope-specific monoclonal 
antibody. Some representative results are shown in Figure 14. 

Example 9; In vivo expressd on and secrctton of linnpaKn reporter Drotefn>; 
It is possible that modifying the protein sequence, by the introduction of epitopes, 
would affect protein folding or secretion. In order to examine this, we have expressed 
the modified MUP proteins in murine Hepal-6 hepatoma cells and in human A2780 
ovarian carcinoma cells. 

MUP lipocalin reporter sequences, containing internal modifications at protein loop 
positions, were cloned into the pSecTag2 vector (Invitrogen). This vector contains a 
murine Ig Kappa signal peptide, a 3'-c-myc and His tag, and is designed to express 
tagged secreted proteins in mammalian cells. 

In this way, 4 MUP reporter constructs, coding for proteins that contain epitope tag. 
modifications at either the N-teiminus. the C-terminus or at the internal FG loop 
position, were created (Figure 15). 

20 The DNA constructs were transfected into both murine Hepal-6 hepatoma cells and 
human A2780 ovarian carcinoma cells, using Fugene transfection reagent (Invitrogen). 
After 72h, medium was collected and analysed for the presence of secreted protein by 
western blotting. A typical blot is shown in Hgure 16. 



15 



25 



30 



Tbe results demonstrate that MUP Upocalin reporter proteins, containing multiple 
modifications, are properly folded and secreted from mammalian ceUs. 

Example lOi Enzvmir detection of lipocalin protein 

To demonstrate the detection of a lipocalin reporter by means of an epitope tag that 
contains enzymic activity, we have examined the GST enzymic activity of the GST- 
tagged MUP lipocalin reporter protein. 
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Mouse urine, that had previously been spiked with GST-niMUP protein (lOO^ig/ml) 
was analysed for GST enzymic activity using a colorimetric assay (GST-Tag Kit, 
Novagen). The assay was perfomed according to the manufacturers recommended 
protocol, using a Hitachi-USOlO spectrophotometer and Ifitachi UV Solutions Version 
5 1.2 software- Absorbance was measured at 340nm. Readings were taken every 30 
seconds for 300 seconds 

The results show that GST-mMUP lipocalin reporter protein can be efficiently 
detected in mouse urine by means of GST enzymic activity (Rgure 17), The activity 
10 of the GST-niMUP protein, in both urine and PBS, is similar to that of GST protein 
itself. 

Example 11; Expression of epitope tagged lipocalin reporter proteins in 
transgenic animals 

IS Transgenic animals are generated using one of several standard methods including 
pronuclear injection (Gordon and Ruddle, Science 214, 1244-1246 (1981)), blastocyst 
injection of transfected cells (Smithies et ah. Nature 317, 230-234 (1985)) or using 
viral vectors (Lois et al. Science 295, 868-872 (2002); Pfeifer et al., Proc. Natl Acad. 
Set USA 99, 2140-2145 (2002)). The transgene comprises DNA firagments including a 

20 promoter sequence driving an open reading frame encoding a tagged-lipocalin. 

For example transgenes contain the mouse Cyplal promoter sequence driving 
expression of myc epitope tagged MUP or BLG reporters, as follows: 

25 pXCS'mycMUP. A 2.4Kb fragment encompassing the murine Cyplal promoter was 
derived by PGR from murine genomic DNA. This was cloned into the vector pXenSs 
(CXR Biosciences) as a SpeVXhol fragment to yield the vector pXen5Cyp. The Cypla 
promoter was subsequently moved from pXen5Cyp into the vector pXen4.3'mycMUP 
(CXR Biosciences) as an SstWXIiol fragment replacing the CMV promoter contained 

30 in this vector. The resultant vector pXC3'mycMUP contains a C-tenninally tagged 
MUP reporter running under the control of the murine Cyplal promoter. 
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pXCS'mycBLG. The BLG reporter was amplified from the vector pBLacD (Roslin 
Institute) by PGR. adding flanking Xhol and Kpril sites and inserting a C-teiminal Myc 
epitope tag. This fragment was digested XhoVKpnl and used to replace the MUP 
reporter in XhoVKpnl digested pXCS'mycMUP vector. The resultant vector 
pXCS'mycBLG contains a C-terminally tagged BLG reporter running under the 
control of Ae murine Cyplal promoter. 

Positive transgenic animals are identified by analysis of DNA (Whitelaw et al.. 
Transgenic Res. 1. 3-13 (1991)) and bred to generate transgenic lines. Transgenic 
animals are exposed to stress, for example by drug administration, and blood and urine 
collected over time. Samples collected pre- and post-insult are analysed for the 
presence of the tagged-lipocaUn by standard methods, including Western blot and 
ELLS A. Depending on the specific insult or inducing agent an increase or decrease in 
reporter activity are detected. 



Transgenes may also be refined to allow expression in specific cells, for example 
through the DNA recombination based strategies (Hering et al., Proc 
NatlJicad.Sci.USA 90, 8469-8473 (1993)1 Gu etal., CeU 73, 1155-1164 (1993)). 

Alternatively DNA promoter-reporter constracts are introduced into somatic cells of 
an animal. This could be achieved through the use of adenovirus (Lai et al., DNA Cell 
Biol. 21. 895-913 (2002), other viral vector methods (Logan et al., Curr. Opin. 
Bioetcnol. 13, 429-436 (2002)) or by non-viral methods including the direct 
introduction of naked DNA (Niidome and Huang, Gene Ther. 9, 1647-1652 (2002). 
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CLAIMS 

1. A nucleic acid construct comprising (i) a nucleic acid sequence encoding a 
member of the lipocalin protein fanMly, and (ii) a nucleic acid sequence encoding a 
peptide sequence of from 5 to 250 amino acid residues 

5 

2. A nucleic acid construct as claimed in claim 1, in which the lipocalin is 
selected from the group consisting of: ovine betalactoglobulin (BLG) (accession No. 
X12817), murine major urinary protein (MUP) (accession No. NM 031188) and rat a- 
2-urinary globulin (a-2u) (accession number M27434). 

10 

3. A nucleic acid construct as claimed in claim 1 or claim 2, in which peptide 
sequence is an epitope. 

4. A nucleic acid construct as claimed in claim 3, in which the epitope is selected 
1 5 from the group consisting of EQKLISEEDL, GZPIPNPLLGLDST. YPYDVPD YA, 

m^STTVRRRA, KQMSDRRENDMSPS, SGNEVSRAVLLPQSC, 
SSLSYTNPAVAATSANL, RSTLQHPDYLQEYST, VSTLLRWERFPGHRQA, 
KFQQLVQCLTEFHAALGAYV, QEQCQEVWRKRVISAFUKSP, and 
RLSDKTGPVAQEKS 

20 

5. A nucleic acid construct as claimed in any one of claims 1 to 4, in which the 
construct additionally comprises a promoter element upstream of the (i) a nucleic acid 
sequence encoding a member of the lipocalin protein family, and (ii) and nucleic acid 
sequence encoding a peptide sequence of from 5 to 250 amino acid residues. 

25 

6. A nucleic acid construct as claimed in claim 5, in which the promoter element 
may be selected from one of the following groups consisting of : 



30 



(i) c-myc, p21/WAF-l, MDM2, Gadd45, FasL, GAHSP40, TRAIL-R2/DR5, 
BTG2/PC3; 
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(ii) MnSOD, CuZnSOD, IkB, A'IP4, xanthine oxidase, COX2, iNOS, Ets-2, 
FasL/CD95L, -yGCS, ORP150. 

(iii) Lrg-21, SOCS-2. SOCS-3. PAI-1, GBP28/adiponectin. a-1 add 
glycoprotein, metaUothioneine I, metallothioneine H, ATF3, IGFbp-3, VDGF 
and HIFla. 

(iv) Gadd 34, GAHSP40, TRAIL-R2/DR5, c-fos. CHOP/Gaddl53, APAF-1, 
Gadd45, BTG2/PC3, Peg3/Pwl, Siahla, S29 ribosomal protein, FasL/CD95L, 
tissue tranglutaniinase, GRP78, Nin77/NGH-B, CyclophilinD, p73 andBak. 

(v) a promoter ftom a xenobiotic metaboUsing cytochrome p450 enzymes ftom 
the 2A, 2B, 2C, 2D, 2E, 2S, 3A, 4A and 4B gene families. 

(vi) a synthetic promoter sequence comprised of a minimal eukaryote 
consensus promoter operatively linked to one or more response elements 
selected from the group consisting of the aryl hydrocarbon (Ah)/Ah nuclear 
translocator (ARNT) receptor response element, the antioxidant response 
element (ARE), the xenobiotic response element (XRE). 

7. A nucleic acid construct comprising a stress inducible promoter operatively 
isolated from a nucleic acid sequence encoding a member of the lipocalin protein 
family by a nucleotide sequence flanked by nucleic add sequences recognised by a • 
site spedfic recombinase, or by insertion such that it is inverted with respect to the 
transcription unit encoding a member of the lipocalin protein family, in which the 
construct additionaDy comprises a nucleic add sequence comprising a tissue spedfic 
promoter operativdy linked to a gene encoding the coding sequence for the site 
spedfic recombinase. 

8. A nuddc add construct as daimed in daim 7, in which the site spedfic 
recombinase sequences are two /oaP sites of. bactoiophage PI. 
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9. A host cell transfected with a nucleic acid construct according to any one of 
claimis 1 to 8. 

5 id. A transgenic non-human animal in which the cells of the non-human animal 
express the protein encoded by the nucleic acid construct according to any one of 
claims 1 to 8. 

11. A transgenic non-human animal as claimed in claim 10, in which the non- 
10 human animal is a mammal 

12. A transgenic non-human mammal as claimed in claim 11, in which the 
manmial is a mouse 

15 13. The use of a nucleic acid construct according to any one of claims 1 to 8 for 
the detection of a gene activation event resulting from a change in altered metabolic 
status in a cell in vitro or in vivo. 

14. A use as claimed in claim 13, in which the gene activation event is the 
20 induction of toxicological stress, metabolic changes, or disease, including a disease 

state that is the result of viral, bacterial, fungal or parasitic infection. 

15. The use of a nucleic acid construct comprising a nucleic acid sequence 
encoding a member of the hpocalin protein family, wherein said lipocalin protein is 

25 heterologous to the cell in which it is expressed, for the detection of a gene activation 
event resulting from a change in altered metabolic status in a cell in vitro or in vivo. 

16. A use as claimed in claim 15, in which the gene activation event is induction of 
toxicological stress, metabolic changes, or disease, including a disease that is die result 

30 of viral, bactmal, fungal or parasitic infection. 
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17. A method of detecting a gene activation event in a cell in vitro or in vivo, 
comprising assaying a host cell stably transfected with a nucleic add construct in 
accordance with any one of claims 1 to 8, or a transgenic non-human animal according 
to any one of claims 10 to 12, in which the cell or animal is subjected to a gene 
activation event that is signalled by expression of a peptide tagged lipocalin rqxMter 
gene. 

18. A method of detecting a gene activation event in a cell m vitro or in vivo, 
con5)rising assaying a host ceU stably transfected with a nucleic acid construct 
comprising a nucldc acid sequence encoding a member of the lipocalin protein family, 
wherein said lipocalin protein is heterologous to the ceU in which it is expressed, or a 
transgenic non-human animal whose cells express such a construct, in which the cell 
or animal is subjected to a gene activation event that is signalled by expression of a 
peptide tagged lipocalin reporter gene. 

19. A method of screening for, or monitoring of toxicologically induced stress in a 
cell or a cell line or a non-human animal, comprising the use of a cell, cell line or non 
human animal which has been transfected with or carries a nucleic acid construct 
according to any one of claims 1 to 8. 

20. A method for screwing and characterising viral, bacterial, fungal, and parasitic 
infection comprising the use of a cell, cell line or non human animal which has been 
transfected with or carries a nucleic add constract according to any one of claims 1 to 
8. 

21. A method for screening for cancer, inflammatory disease, cardiovascular 
disease, metabolic disease, neurological disease and disease with a genetic basis 
comprising the use of a cell, cell line or non human animal which has been transfected 
with or carries a nucleic acid construct according to any cme of claims 1 to 8. 
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Met:LysMei:LeuI«exiIteuIseuCysI«eu61yIteuThrI«euValCysValH±sAlaGluGlu 
1 ATGAAGAT6CTGCTGCTGCT6TGTTTGGGACTGACCCTAGTCT6TGTCCATGCAGAAGAA 

AlaSerSerThrGlyArgAsnPheAsnValGluLysIleAsnGlyGluTrpHisThrlle 
61 GCTAGTTCTACGGGAAGGAACTTTAATGTAGAAAAGATTAATGGGGAATGGCATACTATT 

IleLeuAlaSerAspLysArgGluLysIleGluAspAsnGlyAsnPheArgLeuPheLeu 
121 ATCCTGGCCTCTGACAAAAGAGAAAAGATAGAAGATAATGGCAACTTTAGACTTTTTCTG 

GluGlnlleHisValLeuGluLysSerLeuValLeuLysPheHisThrValArgAspGlu 
181 GAGCAAATCCATGTCTTGGAGAAATCCTTAGTTCTTAAATTCCATACTGTAAGAGATGAA 

GluCysSerGluLeuSerMetValAlaAspLysThrGluLysAlaGlyGluTyrSerVal 
241 GAGTGCTCGGAATTATCTATGGTTGCTGACAAAACAGAAAAGGCTGGTGAATATTCTGTG 

ThrTyrAspGlyPheAsnThrPheThrlleProLysThrAspTyrAspAsnPheLeuMet 
301 ACGTATGATGGATTCAATACATTTACTATACCTAAGACAGACTATGATAACTTTCTTATG 

AlaHisLeuIleAsnGluLysAspGlyGluThrPheGlnLeuMetGlyLeuTyrGlyArg 
361 GCTCATCTCATTAACGAAAAGGATGGGGAAACCTTCCAGCTGATGGGGCTCTATGGCCGA 

GluProAspLeuSerSerAspIleLysGluArgPheAlaGlnLeuCysGliiLysHisGly 
421 GAACCAGATTTGAGTTCAGACATCAAGGAAAGGTTTGCACAACTATGTGAGAAGCATGGA 

IleLeuArgGluAsnllelleAspLeuSerAsnAlaAsnArgCysLeuGlnAlaArgGlu 

481 ATCCTTAGAGAAAATATCATTGACCTATCCAATGCCAATCGCTGCCTCCAGGCCCGAGAA 
*** 

541 TGA 

FIG. 7 



GiyProLeuGlySerMe tGluGlnLysIjeuIleSerGluGltaAspIieuThrMetGliiAla 
1 GGGCCCCrGGGArCCArGGAGCAGAAACTCATCTCTGAAGAGGATGTGACCATGGAAGCT 

SerSerThrGlyArgAsnPheAsnValGluLysIleAsnGlyGluTrpHisThrllelle 
61 AGTTCTACGGGAAGGAACTTTAATGTAGAAAAGATTAATGGGGAATGGCATACTATTATC 

LeuAlaSerAspLysArgGluLysIleGluAspAsnGlyAsnPheArgLeuPheLeuGlu 
12 1 CTGGCCTCTGACAAAAGAGAAAAGATAGAAGATAATGGCAACTTTAGACTTTTTCTGGAG 

GlnlleHisValLeuGluLysSerLeuValLeuLysPheHisThrValArgAspGluGlu 
181 CAAATCCATGTCTTGGAGAAATCCTTAGTTCTTAAATTCCATACTGTAAGAGATGAAGAG 

CysSerGluLeuSerMetValAlaAspLysThrGluLysAlaGlyGluTyrSerValThr 
241 TGCTCGGAATTATCTATGGTTGCTGACAAAACAGAAAAGGCTGGTGAATATTCTGTGACG 

TyrAspGlyPheAsnThrPheThrlleProLysThrAspTyrAspAsnPheLeuMetAla 
301 TATGATGGATTCAATACATTTACTATACCTAAGACAGACTATGATAACTTTCTTATGGCT 

HisLeuIleAsnGluLysAspGlyGluThrPheGlnLeuMetGlyLeuTyrGlyArgGlu 
361 CATCTCATTAACGAAAAGGATGGGGAAACCTTCCAGCTGATGGGGCTCTATGGCCGAGAA 

ProAspLeuSerSerAspIleLysGlxoArgPheAlaGlnLeuCysGluLysHisGlylle 
421 CCAGATTTGAGTTCAGACATCAAGGAAAGGTTTGCACAACTATGTGAGAAGCATGGAATC 

LeuArgGluAsnllelleAspLeuSerAsnAlaAsnArgCysLeuGlnAlaArgGlu*** 
481 CTTAGAGAAAATATCATTGACCTATCCAATGCCAATCGCTGCCTCCAGGCCCGAGAATGA 

FIG. 8 
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aagagtctct 
tgttgctaca 
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agggcaacct 
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gctggggaag 
agggcttttt 
tgaacataaa 
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gtatctaaaa 
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3661 cgggtggtga ccccggggga gccccgctgg tcgtggaggg tgctgggggc tgactagcaa 

3721 cccctccccc cccgttggaa ctcacttttc tcccgtcttg accgcgtcca gccttgaatg 

3781 agaacaaagt ccttgtgctg gacaccgact acaaaaagta cctgctcttc tgcatggaaa 

3841 acagtgctga gcccgagcaa agcctggcct gccagtgcct gggtgggtgc caaccctggc 

3901 tgcccaggga gaccagctgc gtggtccttg ctgcaacagg gggtgggggg tgggagcttg 

3961 atccccagga ggaggagggg tggggggtcc ctgagtcccg ccaggagaga gtggtcgcat 

4021 accgggagcc agtctgctgt gggcctgtgg gtggctgggg acgggggcca gacacacagg 

4081 ccgggagacg ggtgggctgc agaactgtga ctggtgtgac cgtcgcgatg gggccggtgg 

4141 tcactgaatc taacagcctt tgttaccggg gagtttcaat tatttcccaa aataagaact 

4201 caggtacaaa gccatctttc aactatcaca tcctgaaaac aaatggcagg tgacattttc 

4261 tgtgccgtag cagtcccact gggcattttc agggcccctg tgccaggggg gcgcgggcat 

4321 cggcgagtgg aggctcctgg ctgtgtcagc cggcccaggg ggaggaaggg acccggacag 

4381 ccagaggtgg ggggcaggct ttccccctgt gacctgcaga cccactgcac tgccctggga 

4441 ggaagggagg ggaactaggc caagggggaa gggcaggtgc tctggagggc aagggcagac 

4501 ctgcagacca ccctggggag cagggactga cccccgtccc tgccccatag tcaggacccc 

4561 ggaggtggac aacgaggccc tggagaaatt cgacaaagcc ctcaaggccc tgcccatgca 

4 621 catccggctt gccttcaacc cgacccagct ggagggtgag cacccaggcc ccgcccttcc 

4 681 ccagggcagg agccacccgg ccccgggacg acctcctccc atggtgaccc ccagctcccc 

4741 aggcctccca ggaggaaggg gtggggtgca gcaccccgtg ggggccccct ccccaccccc 

4801 tgccaggcct ctcttcccga ggtgtccagt cccatcctga cccccccatg actctccctc 

4861 ccccacaggg cagtgccacg tctaggtgag cccctgccgg tgcctctggg gtaagctgcc 

4921 tgccctgccc cacgtcctgg gcacacacat ggggtagggg gtcttggtgg ggcctgggac 

4981 cccacatcag gccctggggt cccccctgtg agaatggctg gaagctgggg tccctcctgg 

5041 cgactgcaga gctggctggc cgcgtgccac tcttgtgggt gacctgtgtc ctggcctcac 

5101 acactgacct cctccagctc cttccagcag agctaaggct aagtgagcca gaatggtacc 

5161 taaggggagg ctagcggtcc ttctcccgag gaggggctgt cctggaacca ccagccatgg 

5221 agaggctggc aagggtctgg caggtgcccc aggaatcaca ggggggcccc atgtccattt 

5281 cagggcccgg gagccttgga ctcctctggg gacagacgac gtcaccaccg cccccccccc 

5341 atcaggggga ctagaaggga ccaggactgc agtcaccctt cctgggaccc aggcccctcc 

5401 aggcccctcc tggggctcct gctctgggca gcttctcctt caccaataaa ggcataaacc 

54 61 tgtgctctcc cttctgagtc tttgctggac gacgggcagg gggtggagaa gtggtgggga 

5521 gggagtctgg ctcagaggat gacagcgggg ctgggatcca gggcgtctgc atcacagtct 

5581 tgtgacaact gggggcccac acacatcact gcggctcttt gaaactttca ggaaccaggg 

5641 agggactcgg cagagacatc tgccagttca cttggagtgt tcagtcaaca cccaaactcg 

5701 acaaaggaca gaaagtggaa aatggctgtc tcttagtcta ataaatattg atatgaaact 

5761 caagttgctc atggatcaat atgcctttat gatccagcca gccactactg tcgtatcaac 

5821 tcatgtaccc aaacgcactg atctgtctgg ctaatgatga gagattccca gtagagagct 

5881 ggcaagaggt cacagtgaga actgtctgca cacacagcag agtccaccag tcatcctaag 

5941 gagatcagtc ctggtgttca ttggaggact gatgttgaag ctgaaactcc aatgctttgg 

6001 ccacctgatg tgaagagctg actcatttga aaagaccctg atgctgggaa agattgaggg 

6061 caggaggaga aggggacgac agaggatgag atggttggat ggcatcacca acacaatgga 

6121 catgggtttg ggtggactcc aggagttggt gatggacagg gaggcctggc gtgctacgga 

6181 agcggtttat ggggtcacaa agactgagtg actgaactga gctgaactga atggaaatga 

6241 ggtatacagc aaagtgggga ttttttagat aataagaata tacacataac atagtgtata 

6301 ctcatatttt tatgcatacc tgaatgctca gtcactcagt cgtatctgac tctgtgacct 

63 61 atggaccgta gccttccagg tttcttctgt ccacagaatt ctccaaggca agaatactgg 

6421 agtgggtagc catttcctcc tccaggggat cctcccgacc cagggattga accggcatct 

6481 cctgtattgg caggtggatt ctttaccact gtgccaccag ggaagcccgt gttactctct 

6541 atgtcccact taattaccaa agctgctcca agaaaaagcc cctgtgccct ctgagcttcc 

6601 cggcctgcag agggtggtgg gggtagactg tgacctggga acaccctccc gcttcaggac 

6661 tcccgggcca cgtgacccac agtcctgcag acagccgggt agctctgctc ttcaaggctc 

6721 attatcttta aaaaaaactg aggtctattt tgtgacttcg ctgccgtaac ttctgaacat 

6781 ccagtgcgat ggacaggacc tcctccccag gcctcagggg cttcagggag ccagccttca 

6841 cctatgagtc accagacact cgggggtggc cccgccttca gggtgctcac agtcttccca 

6901 tcgtcctgat caaagagcaa gaccaatgac ttcttaggag caagcagaca cccacaggac 

6961 actgaggttc accagagctg agctgtcctt ttgaacctaa agacacacag ctctcgaagg 

7 021 ttttctcttt aatctggatt taaggcctac ttgcccctca agagggaaga cagtcctgca 

7081 tgtccccagg acagccactc ggtggcatcc gaggccactt agtattatct gaccgcaccc 

7141 tggaattaat cggtccaaac tggacaaaaa ccttggtggg aagtttcatc ccagaggcct 

7201 caaccatcct gctttgacca ccctgcatct ttttttcttt tatgtgtatg catgtatata 

7261 tatatatata tttttttttt tttcattttt tggctgtgct ggctgttcgt tgcagttcgg 

7321 tgcgcaggct tctctctagt ttctctctag tcttctctta tcacagagca gtctctaga 
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As far as claims 13 to 21 are directed to a diagnostic method practised on 
the human/animal body, the search has been carried out and based on the 
alleged effects of the compound/composition. 

2. [ I Claims Noa.: i 

because they relate to parts of the Intemationat AppRcation that do not comply with the prescribed requirements to such 
an e)dent that no meaningful International Search can be carried out, speclficaOy: 



3. rn OalmsNos.: 

— because they are dependent claims and are not drafted In accordance with the second and third sentences or Rule 6.4(a). 



Box n Otiservations where unity of Invention is lacking (Continuation of Item 2 of first shee^ 



This IntemaUonai Searching Authorlly found multiple Inventions in this international application, as foliofvs: 

see additional sheet 

1. I I As ail required additional search fees were Hmeiy p^d by the applicant, this International Search Report covers aH 
I — " searchable claims. 

2. I I As eil searchable claims could be searched without effbrt Justifying an additional fee, this Authority did not invite payment 

of any addftlonai fee. 

3. I y I As only some of the required additional search lees were timely paid by the applicant, INs IntemaSonal Search Report 

covers only those claims fbr which fees were paid, specifically claims Nosj 

1-6 9-21 



4. rn No required additional search fees were timely paid by the applicant Consequently, this Intemadcvial Search Report is 
restricted to the invention first mentioned hi ttie claims; it is covered by claims Nos.: ' 



Remaric on Protest [ | The addltional search fees were accompanied by the applicant's protest 

I X I No protest accompanied the payment of additional search fees. 



l=orm PCT/ISA/210 (continuation of first sheet (1 )) (July 1998) 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



International AppHcation No. PCT/GB 03/03192 



IIliLl"*®''"?*^???^ Searching Authority found multiple (groups of) 
inventions m this international application, as follows: ^ 

1. claims: 1-6, 9-21 (all partially) 

A nucleic acid construct comprising (i) a nucleic acid 
an3"?y?f ^"^°d?"9 3 "'?!!*er of the lipocalin protein family, 
of fiii^MJ^oin'*" sequence encoding a peptide sequence 
of from 5 to 250 amino acid residues; said nucleic acid 
construct when the lipocalin is ovin4 betalactog obuf n 
(BLG) (accession No X12817); a host cell with said nucleic 
a transgenic non-human animal in which the 
°\*V^ non human ammal express the protein encoded bv 
said nucleic acid construct; the use of said nucleic acid 
construct for detection of a gene activation event ^etSlti no 
from a change in an altered metabolic status in a cell in ^ 
vitro or in vivo; a method for the detection of a gene 
activation event in a cell in vitro or in vivo, comorisina 
lliH^ln^ ; stably transfected with said nSeii ^ 

acid construct, wherein said ovine betalactoglobulin is 
heterologous to the cell in which it is expressed or I 
transgenic non-human animal, whose cells expressed such a 
SSI*!"?* 1^ "^'""^ the cell or animal is subjected to a 
gene activation event that is signalled by exoression of » 
peptide tagged ovine BLG reporte? gene; ^^^""^"1°" a 



2. claims: 1-6, 9-21 (all partially) 



A nucleic acid construct comprising (i) a nucleic acid 
sequence encoding murine major urine protein fMUP) - 
l^^oS?ni°" "° NM 031188) and (ii) a nScleic aiid sequence 
encoding a peptide sequence of from 5 to 250 amino acid 
residues; a host cell with said nucleic acid construct- a 
transgenic non-human animal in which the cells of the non 
human animal express the protein encoded by said nucleic 

5fjlj?"^*T*» "^f.**^ ""cleic acid construct for 
detection of a gene activation event resulting f rom a chanoe 
in an altered metabolic status in a cell n SftrJ^r in 
vivo; a method for the detection of a gene activaJiiS 
event in a cell m vitro or in vivo, comorisina as^avn-nn » 
host cell stably transfected with said Sic fcid^"^ ^ 
construct . wherein said murine MUP is heterologous to the 

1 " "^'"'^ expressed, or a transgenic non-human 

an mal. whose cells expressed such a construct, in which the 

IiaL?fI?i subjected to a gene activation evSl tha? 
rIportS? Jme;^ ^''P''^"'^" °^ ^ P^Pti^e tagged murine MUP 

3. claims: 1-6, 9-21 (all partially) 
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A nucleic acid construct comprising (i) a nucleic acid 
sequence encoding rat alpha-2-urinary globulin (alpha -2u) 
(accession number M27434) and (ii) a nucleic acid sequence 
encoding a peptide sequence of from 5 to 250 amino acid 
residues; a host cell with said nucleic acid construct; a 
transgenic non-human animal in which the cells of the non 
human animal express the protein encoded by said nuclaic 
acid construct; the use of said nucleic acid construct for 
detection of a gene activation event resulting fronm a 
change in an altered metabolic status in a cell in vitro or 
in vivo; a method for the detection of a gene activation 
event in a cell in vitro or in vivo, comprising assaying a 
host cell stably transfected with said nucleic acid 
construct , wherein said rat alpha-2-urinary globulin (alpha 
-2u) is heterologous to the cell in which it is expressed,, 
or a transgenic non-human animal, whose cells expressed such 
a construct, in which the cell or animal is subjected to a 
gene activation event that is signalled by expression of a 
peptide tagged rat alpha-2- urinary globulin (alpha -2u) 
reporter gene; 



4. claims: 7- 8 and partially 9-21 

A nucleic acid construct comprising a stress inducible 
promoter operatively isolated from a nucleic acid sequence 
encoding a member of the lipocalin protein family by a 
nucleic acid sequence flanked by nucleic acid sequence s 
recognised by a sire specific recombinase, or by insertion 
such that it is inverted with respect to the transcription 
unit encoding a member of the lipocalin ptotein family, in 
which the construct additionnally comprises a nucleic acid 
sequence comprising a tissue specific promoter operatively 
linked to a gene encoding the coding sequence for the site 
specific recombinase; a host cell with said nucleic acid 
construct; a transgenic non-human animal in which the cells 
of the non human animal express the protein encoded by said 
nucleic acid construct; the use of said nucleic acid 
construct for detection of a gene activation event resulting 
from a change in an altered metabolic status in a cell in 
vitro or in vivo; a method for the detection of a gene 
activation event in a cell in vitro or in vivo, comprising 
assaying a host cell stably transfected with said nucleic 
acid construct , wherein said lipocalin is heterologous to 
the cell in which it is expressed, or a transgenic non-human 
animal, whose cells expressed such a construct, in which the 
cell or animal is subjected to a gene activation event that 
is signalled by expression of a peptide tagged lipocalin 
reporter gene 



5. claims: 15-16, 18 (all partially) 
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The use of a nucleic acid construct comprising a nucleic 
acid sequence encoding a member of the lipocalin protein 
family, wherein said ipocalin protein is heterologous to 
the cell m which it is expressed, for the detection of a 
gene activation event resulting from a change in altered 
?^ 5h^^S 'V vitro or in vivo; a method 

for the detection of a gene activation event in a cell in 
vitro or in viyo. comprising essaying a host cell stably 
transfected with a nucleic acid construct comprising a 
nucleic acid sequence encoding a member of the lipocalin 
protein family, wherein said lipocalin protein is 
heterologous to the cell in which it is expressed, or a 
transgenic non-human animal, whose cells expressed such a 
construct, in which the cell or animal is subjected to a 

S-H''*r^*^2"i?''^"M''^^* signalled by expression of a 
peptide tagged lipocalin reporter gene, as far as not 
covered by a previous subject; 

6. claim: 18 (partially) 

A method for the detection of a gene activation event in « 
cell in vitro or in vivo, comprising essaying a host cell 
l transfected with a nucleic acid construct comprising 
ov.«+f-®^S ^^^^ sequence encoding a member of the lipocalin 
protein family, wherein said lipocalin protein is 
heterologous to the cell in which it is expressed, or a 
transgenic non-human animal, whose cells expressed such a 
construct, in which the cell or animal is sSbjl??ed So a 

nf[!?^3^*i^«i'S",?''^"*it^^^* signalled by expression of a 
peptide tagged lipocalin reporter gene, as far as not 
covered by a previous subject; • - ^ 
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